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Introduction 


This book arose essentially from two sources; a course called “Mathe- 
matiques ^lementaires Approfondies”, taught in 1972-1973 and 1973-1974 
at the University of Paris VII; and the author’s fifteen years of experience in 
preparing the geometry part of the orals in the Agr^gation de Mathematiques, 
a competition to select the best high school teachers in FVance. 

The main objectives pursued in this book, more precisely, in the frame¬ 
work of elementary geometry, are the following; 

— to emphasize the visual, or “artistic”, aspect of geometry, by using figures 
in abundance; 

— to accompany each new notion with as interesting a result as possible, 
preferably one with a simple statement but a non-obvious proof; 

— finally, to show that this simple-looking mathematics does not belong in a 
museum, that it is an everyday tool in advanced mathematical research, 
and that occasionally one encounters unsolved problems at even the most 
elementary level. 

Here are some particularities of this book that derive from the general 
principles above. Main definitions are, whenever possible, followed by non¬ 
trivial (and sometimes new) examples. Figures abound: at the beginning, 
especially, each geometric reasoning is iUustrated by a diagram. (Such was 
the general practice fifty years ago, but pictures have all but disappeared 
from modern geometry books. One reason seems to be that authors think 
that readers keep pencil and paper next to them and draw figures as they 
go along, or else draw mental pictures. But the author’s experience from 
university examinations shows that students aren’t likely to draw pictures, 
either on paper or in their heads. Thus one of the aims of this book is to 
teach the reader to make systematic use of figures as he reads.) 

Notes are also common, referring to both the historical development and 
the current, often very advanced, applications of the ideas introduced. This 
is meant to convey to the reader a feeling that the elementary mathematics 
that he is studying is an integral part of the living, continuing corpus of 
mathematical knowledge. The notes are backed by an extensive bibliography. 



XU 


Introduction 


References in brackets are to book and page number, except in the case of 
[B-G] and Bourbaki, which use a system similar to the one adopted in this 
book. Other exceptions are always clearly labeled. 

There are many internal references as well. The author believes that 
most readers will not read this book sequentially, from beginning to end. 
They will pick it up to find such and such a theorem, or a discussion of such 
and such a subject; or they will be “hooked” by a figure or result while leafing 
through the book, and then will want to read about the background of that 
figure or result. Either way their task will be facilitated by the many internal 
references and by the two indexes at the end, the second of which attempts 
to include all references to a given entry, not just the primary one. 

We will not dwell on the contents for long, but it’s worth mentioning a 
few results included here that are often absent from comparable books, or 
relegated to exercises: the fundamental theorem of affine geometry, the clas¬ 
sification of crystallographic groups, the classification of regular polytopes 
in arbitrary dimension, Cauchy’s theorem on the rigidity of convex polyhe- 
dra, the discussion of polygonal billiards, Poncelet’s theorem on polygons 
inscribed in a conic, the Villarceau circles on the torus, Clifford parallelism, 
the isoperimetric inequality in arbitrary dimension, the theorems of HeUy and 
Krasnosel’skii, the simplicity of the orthogonal group, the theorems of Witt 
and Cartan-Dieudonne, and complete expositions of spherical, elliptic and 
hyperbolic geometry. 

This book can be used in different ways. Besides consulting it as a 
source for problems and for its extensive bibliography, teachers can use it as 
a textbook for several courses: a freshman-level course on Euclidean geometry 
(based on chapters 8 and 9), a higher-level course on the sphere and hyperbolic 
geometry (based on chapters 18 and 19), a higher-level course on convexity 
from the geometric point of view (based on the whole or parts of chapters 11 
and 12). 

One last important point. Most of the exercises are more difficult than 
those in comparable books, since the text itself already contains a huge num¬ 
ber of ideas for exercises and applications. In particular, the reader often 
has to find out what to show. This wiU require from him a certain amount 
of maturity, a will to take risks and to develop his originality. Many of the 
* exercises, marked with a star, are completely solved in the companion volume 

[B-P-B-S]. 

This work owes much to the whole of the mathematical community, both 
students and colleagues. In particular, I’ve profited a lot from lectures by 
candidates to the Agregation, which showed me the right and the wrong 
way to teach things; I’ve also leant heavily on many colleagues for advice 
and references. Since I owe so much, I would not dare include a necessarily 
incomplete list of acknowledgements; I ask you all to accept my warm, global 
thanks. 

I would like to thank Silvio Levy for his rapid and excellent translation. 

M. Berger 



It was in 1897, 1 was then a third-year student 
at the ^ole Normale, and Joseph Caron, a teacher 
whose name I bring up with pleasure, had assigned 
us a very difficult exercise in descriptive geometry. 
It involved the intersection of two tori arranged in 
such a way that, to find an arbitrary point of the 
intersection, one would use sections of the tori by 
certain spheres bitangent to each of them. 

Joseph Caron taught descriptive geometry in 
Paris schools; he also held the chair of geometric 
drawing at the Sorbonne and the ^ole Normale. 
He has written a treatise on descriptive geometry 
and a manual on topographic projections (a reaction 
against programs that he condemns); he has pub¬ 
lished in the Bulletin de la Societe Mathematique de 
France an article about the construction of a third- 
order surface having its twenty-seven lines real (a 
plaster cast of this surface is part of the higher ge¬ 
ometry collection of the University of Paris); above 
all, he has contributed to instilling the love of geom¬ 
etry into numerous students, at a time when many 
eminent scholars, endowed with great geometric tal¬ 
ent, make a point of never disclosing the simple and 
direct ideas that guided them, subordinating their 
elegant results to abstract general theories which of¬ 
ten have no application outside the particular case 
in question. Geometry was becoming a study of al¬ 
gebraic, differential or partial differential equations, 
thus losing all the charm that comes from its being 
an art. 


Henri Lebesgue 
[LBl, 209-210] 



Chapter 0 

Notation and background 


0.1. Set theory 

If A and B are subsets of E, we denote their difference set by 
j4\S = {x £ E I X e a and x ^ B}. 

If f : E —* F is a. map and A C E, we denote by /|^ the restriction of / 
to A. The identity map from X into itself will be denoted by Idxi and the 
cardinality of X by #X. 

If {x,}, = i „ is a family, we write {xi,..., x,,..., x„} for the same 

family without the element x,-. 


0.2. Algebra 


We denote by N the non-negative integers, by Z the ring of integers, by 
R the field of real numbers, by C the field of complex numbers and by H the 
skew field of quaternions (see 8.9). The non-negative (resp. non-positive) real 
numbers are denoted by R+ (resp. R-), and the strictly positive (resp. neg¬ 
ative) reals by R^ (resp. Rl). H K is a field, K* denotes the set K \0. 

If E and F are sets endowed with the same kind of algebraic structure, 
Hom(£;r) and Isom(.E;F) stand for the set of homomorphisms and isomor¬ 
phisms, respectively, from E into F. However, if E and F are vector spaces, 
we write L(E;F) instead of Hom(£^', F) for the set of linear maps from E 
into F, and GL(F) for Isom(F;iE'), the so-called linear group of E. When 
two vector spaces are considered, they are vector spaces over the same field, 
unless we explicitly say otherwise. 

H a vector space E is the direct sum of two subspaces A and B, we write 
E = A ® B. The algebraic dual of a vector space E is denoted by E*. 
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We denote by I the identity matrix, and by *A the transpose of the 
matrix A. 

If X is a set we denote by Sx the group of permutations (that is, 
bijections) of X under composition of maps: fg = fog. When X = 
{1,2,we write S„ for Sx (the symmetric group of order n), and 
for the alternating group of order n, defined as the subgroup of formed by 
even permutations. The Klein group 1^4 is the product Z 2 x Z 2 of the group 
with two elements with itself. The dihedral group D 2 n, where n is an integer, 
is the extension by Z 2 of the cyclic group Z„, with relation ab = ba~^, where 
a (resp. b) is a generator of Z„ (resp. Z 2 ). 

Binomial coefficients are denoted by (") (see 1.5.2). 

All fields are assumed commutative, unless we state otherwise. 


0.3. Metric spaces 

Let X be a metric space with distance d. The diameter diam(A) g 
R+ U 00 of a subset A C X is defined as diam(A) = sup { d(x, y) | x, y € A }. 
The distance d(A,B) between two subsets A, B C X is the non-negative 
number 

d(A, B) = inf {d(x, y) | x € A, y e B }. 

We also write d(x. A) = d{{x},A). The distance £i(A, B) should not be 
confused with 6[A,B) (introduced in 9.11). 

The open ball of radius r around a point o is denoted by l/(o, r) ={ x e 
X I d{a, x) < r }, and the corresponding closed ball by B(o, r) = ( x e X | 
d(a,x) < r). Balls are also defined around sets: we put 

U(A,r) = {x e X I d(x,A) < r}, 

B(A, r) = {x € X I d(x. A) < r}. 

We also write Ux(-, ) and Bx(-, ) if necessary. 

If X, y are metric spaces, we denote by Is(X; Y) the set of isometries 
from X into Y, i.e., the set of maps f : X —* Y such that 

d{f(x), /(y)) = d(x, y) for x, y € X. 

In particular, we write Is(X) = Is(X;X). 

0.4. General topology 

We shall use several times the fact that the intersection of a decreasing 
Family of compact sets is non-empty (see a proof in 11.7.3.2). 




0.5. Hyperbolic trigonometry 
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0.5. Hyperbolic trigonometry 

The hyperbolic cosine, sine and tangent of a real number t are deSned 
by the formulas 

e* + e“* . e* — sinht 

cosht= ---, sinht = ---, tanh t = -r—. 

2 2 cosht 

The inverse of the restriction of the hyperbolic cosine to the positive real axis 

is called the principal hyperbolic arc-cosine; it is a map Arccosh ; (1, oo[—» 

R+. 

0.6. Lebesgue measure; integration theory 

In certain parts of the text we shall use results from Lebesgue measure 
and integration theory, especially the following fundamental theorems: the 
dominated convergence theorem, Fubini’s theorem, and the change of variable 
theorem for maps in open sets of R**. We shall also need the characteristic 
function XK of a set K, and the image of a measure under a map. A general 
reference for these topics is [RU]. 



Chapter 1 

Group actions: examples and 
applications 


In this chapter we define group actions, transitivity, stabiliz¬ 
ers, homogeneous spaces and faithful actions. These notions are 
abundantly illustrated with examples from algebra and particu¬ 
larly from geometry. These examples will be encountered again 
in later chapters. 

The last two sections, 1.7 and 1.8, are devoted respectively 
to crystallographic groups and finite groups of rotations around 
a fixed point in space. These two topics make full use of the 
language introduced in the previous sections. They were chosen 
for their esthetic interest and also because, in spite of the ele¬ 
mentary nature of the questions posed, their solution presents a 
certain degree of difficulty. Section 1.8 is closely connected with 
the theory of regular polyhedra in three-dimensional Euclidean 
space (section 12.5). 

The idea of considering the geometry of a space as the study 
of properties invariant under a group of transformations is due to 
Felix Klein, who formulated it in his famous “Erlangen program” 
(see, for example, [GGl, p. 367]). 



1.1. Group actions 


5 


1.1. Group actions 

1.1.1. Definition. Let G be a group and X a set. A G-actiqn on X is a 
homomorphism 4>: G —* Sx ■ If <i> is a G-aetion on X, we say that G acts or 
operates on X (by <f>). 

1.1.2. Notation. The action ^ is sometimes written (G, X, or even 
(G, X). H ^ is fixed, we write g(x) for 4>{g)[x), where g €: G and x 6 X. 

It follows from the definition that the map x —► g(x) b a bijection for 
every g G, and that g(h(x)) = {gh){x) for all g, h e G, x € X. Abo 
e(x) = X, where e b the identity of G, and 4>{g~^) = (^(g))~^. 

1.2. Examples 

1.2.1. G b a subgroup of 5x; i'his is the most frequent case. For example, 
G can be defined as the subgroup of Sx satisfying certain conditions. 

1.2.2. Put A = n}, Sa = Sn (the symmetric group). Let G = Sn- 

Then G acts on A; but it also acts in a natural way on X = Pn,p = { P C A | 
ffP — p }, the set of subsets of A with p elements (0 < p < n). 

1.2.3. For a given g £ S„, put G={g*|fceZ}c Then G acts on 

X={l,...,n}. 

1.2.4. If X b a vector space, its linear group 

G = GL(X) = {/ :X—♦X|/is linear and bijective } 

acts on X. 

1.2.5. Let £ be a Euclidean vector space (chapter 8), and put 

G = 0[E) = { / € GL(^) j / is an isometry }. 

Then there b a natural 0(X)-action on X — E. But 0(E) also acts on each 
grassmannian (manifold) 

X = Ge.,, = {V^ cXjyisa p-dimensional vector subspace of P }, 
and again on the set X of orthonormal bases of E. 

1.2.6. Let X = G be a group. Then G acts on itself in several important 
ways: 

4){g)(h) = gh (left translations); 

(f(g)(h) = hg (right translations); 

(f(g)(h) = ghg~^ (inner automorphisms). 

1.2.7. Let X be an affine plane (chapter 2), GA(X) = G the group of affine 
bijections of X (called the affine group of X), and r(X) the set of conics of 
X (a conic is a subset of X given by a certain type of equation, see 16.7). 
Then GA(X) acts on r(X). 
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1.2.8. Let X be a metric space, with distance d. The isometry group G = 
Is(X), defined in 0.3, acts on X in a natural way. 

1.2.9. Put G = R, X=5® = {xe R* | j|x|| = 1} C R^. Identifying R'* 
with C^, we define an operation (R,5®, 4’) by 

^(t)(2.a') = (e«a,e‘V). 

This example is of great importance in geometry: see 4.3, 6.2, 18.8.1. 

1.2.10. For other examples, see 4.5.9, 8 . 8 , 9.5, 18.10. 

1.3. Faithful actions 

1.3.1. Definition. The action (G, X, $) is called faithful if $ is injective 
(in other words, if only e € G maps to the identity Idx). 

This always happens when G C 5x (cL 1.2.1). If G is not faithful, one can 
consider the associated faithful action (G/Ker^,X,^). 

In section 1 . 2 , all actions are faithful, with the following exceptions; in 
1.2.9, we have Ker^ := 2nZ] and in 1.2.6, the action of inner automorphisms 
is faithful if and only if the center Za of G consists only of the identity. 

1.4. Transitive actions 

1.4.1. Definition. The action (G,X,4>) is called transitive if for every x,y 
in X there exists g € G such that g(x) = y. 

y 

X 

Figure 1.4.1 

In practice, it suffices to check that some fixed a ^ X can be mapped to 
any x €: X hy some element g & G. For if we have 9 ( 0 ) = x and h(a) = y, 
then y = {hg~^){x). 

1.4.2. Examples. We study the examples in 1.2 from the point of view of 
transitivity: 

— 1 . 2.2 is transitive; 

— 1.2.3 is transitive if and only if the permutation g is cyclic; 

— 1.2.4 is not transitive. Is it transitive on X \ 0 ? 

— 1.2.5 is not transitive on E (why?), but is transitive on any Ge,p (why?); 

— 1 . 2 . 6 : translations are transitive, but inner automorphisms are not (every 
point is fixed if G is abelian!); 





1.4. Transitive actions 
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— 1.2.7 is not transitive. For example, ellipses cannot be taken into parabo¬ 
las (see 15.3.3.2); 

— 1.2.9 is not transitive—see 18.8. 

1.4.3. Definition. Th.e. action (G, X) is called simply transitive if for every 
x,y €: X there exists a unique g G G suck that g{x) = y. 

1.4.4. Examples. 

1.4.4.1. Proposition. IfG is an abelian group, any faithful transitive action 
is simply transitive. 

Proof, ff for some x € X and g,h G G we have g{x) = h[x), then g(y) = h{y) 
for any y G G, since 

9(y) = g{l({x)) = k{g{x)) = *:(h(x)) = h(fc(x)) = h({/). □ 

1.4.4.2. Left translations (cf. 1.2.6} are simply transitive. 

1.4.4.3. The group 0(E) does not act simply transitively on Ge.p for any 
0 ^ P ^ dim E, but does act simply transitively on the set of orthonormal 
bases. 

1.4.5. Generalization. We say that (G, X) is p-transitive (p G N) if G 
acts transitively on p-tuples of distinct points of X. 

See examples in 2.3.3.5, 4.5.10, 4.6.9, 6.1.1, 9.1.6, 9.1.7, 9.6.2, 9.7.1, 18.5.5, 
18.8.4, 18.10.6, 19.4.5.1. 

1.5. Stabilizers; homogeneous spaces 

1.5.1. Definition. If (X, G) is a group action and X G X, the stabilizer or 
isotropy group of x is the subgroup G^ — { g G G \ g(x) = x }. 

Stabilizers in some sense tell us how far a group is from acting simply transi¬ 
tively: just notice that g[x) = h(x) if and only if is in the stabilizer of x. 

1.5.2. Examples. In 1.2.2, for X = A and X = 1, we have a natural 
isomorphism G] ~ 5„_i. For X = Sn.p, we have 

^(1 . 1 ') — ^ Sn-i>. 

These isomorphisms will allow us to count the number of permutations, 
ifSn = ti!, and of combinations, ifPn.p — (”)• See 1.5.8. 

For inner automorphisms (1.2.6) we have the following equality: 

^!7 ~ {h€G|hg=g/i}, 

the set of elements of G commuting with g. 

In 1.2.5, for X = Ge.p, we have the isomorphism Gi = G(p) x 0{n — p) 
(where n = dim E and 0(p) = 0(R'’)). 
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1.5.3. One always has 

^g{x) = gGxg~''- 

In other words, and Gx are conjugate subgroups of G, and in particular 

are always isomorphic. Example 1.4.4.1 is an immediate consequence of this 
remark. 

1.5.4. Definition. The set X is called a homogeneous space under G (for 
the action <j>) if (G, Jf, <f) is transitive. 

Let X be homogeneous under (G, X, 4>), and fix x e X. Define 6 : G —* X hy 
^( 3 ) = g{^)- Since g{x) = h(x) is equivalent to h~^g £ G*, we can pass to 
a map 9 : G/Gx —* X, where G/G^ is the quotient of G by the equivalence 
relation g ~ h <=>• h~^g £ G*; 

G G/Gx 

e\ 1/'» 

X 

(Observe that, in general, G/Gx is not a group.) By construction, 6 is bijec- 
tive. Thus we have, in the set-theoretical sense, 

set 

1.5.5 X s G/Gx. 

This equation is invaluable, for it reduces the study of the set X to an al¬ 
gebraic problem, namely, the study of the pair (G, G*). See, for example, 
IB-H), |WFJ. 

1.5.6. Corollary. If G is finite, so is X. Moreover, for every x € X, we 

feave (#G)/(#G,). □ 

1.5.7. It is important to notice that if G and X are topological spaces and 
^ : G X X —» X is a continuous map, the topological space X b not, in 
general, homeoinorphic to G/Gx (with the quotient topology). See 1.9.1 for 
a counterexample. 

1.5.8. Using an elementary reasoning (the so-called “shepherd principle”: to 
compute the number of sheep in a flock, count the number of legs and divide 
by four, cf. (BIO, 179]), one deduces from 1.5.2 and 1.5.6 that f/A = n = 
(#5n)/(#Sn-i), and hence, by recurrence, that j/Sn — Thus 

»e = 

^ (#5„)(#5n-,.) p!(n-p)!- 

1.5.9. From 1.5.2 and 1.5.5 one deduces the expression for the grassmannian 
Ge.ji as a homogeneous space G„,p = 0(n)/(0(p) X 0(n — p)). See some 
applications in [HU, chapter 18]; see also 14.3.7. 
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1.6. Orbits; the class formula 

We now study non-transitive actions. 

1.6.1. Definition. Let (G, Jf) be a homogeneous space. The orbit 0(x) of 
X G: X is defined as 

0{x) = { g{x) I G}. 

Thus orbits are just equivalence classes under the equivalence relation 

X ~ y <=>• there exists g €: G such that g[x) = y. 

In particular, we can form the space of orbits XfG. FVom 1.5.5 we deduce 
that, for all X € Jf, 

set 

1 .6.2 0(x) = G/G^. 


1.6.3. Corollary. If G is finite, so is 0[x) for every x, and 

#0(x) = (#G)/(#G»). 

Proof. This follows from 1.6.2 and 1.5.6. □ 

1.6.4. Examples. In 1.2.3, the orbits are called cycles of the permutation g. 
Their study leads to a complete classification of conjugate elements in 

In 1.2.5, for X = E, the orbits are the spheres 5(0, r) centered at the 
origin (where r £ R+). 



S(0, r) 


In 1.2.9 the structure of the set of orbits is fascinating. It is straight¬ 
forward to see that the orbits are aU circles (like = R/2TrZ), but less so 
to visualize how they all fit together to form the famous Hopf fibration: any 
two of the circles are linked, and the quotient space is homeoniorphic to the 
sphere (see 4.3.6). See 18.8 and 18.9 for different approaches to this very 
important example. 
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1.6.5. Note. Given an action (G,X), the problem of finding the orbits 
and parametrizing the set X/G can often be described as a classification 
problem. We shall see numerous examples of this: 2.7.5.11 deals with the 
examples in 1.2.6 and 1.4.2; sections 8.6 and 8.7 deal with angles; 13.1.4 with 
the classification of quadratic forms; 15.2, with affine quadrics; see also 18.6 
and 13.10. 

1.6.6. Corollary (the class formula). Let X and G be finite, and 
consider a G-action on X. If A G X is a subset which intersects each orbit 
in exactly one point, we have 

The set A is called a section of the map p : X —* XjG because it is the image 
s(X/G) of a section s : XfG —* X o{ p, i.e., a map such that p o s = Idx/a- 
One can say that A parametrizes the orbits. 


s 



1 i 

X/G 


0(x) 



Figure 1.6.6 


1.6.7. Application to p-groups 

1.6.7.1. Definition. A finite group G is called a p-group if ffG = p”\ where 
p is a prime and m £ N*. 

1 .6.7.2. Proposition. Every p-group has a non-trivial center. 

Proof. Consider the action of G on itself by inner automorphisms ( 1 . 2 . 6 ), call 
Za the center of G, and look at the induced G-action on X = G\Za (cf. 1.4.2, 
1.2.6). If A parametrizes X/G we have, by 1.6.6, 


#^=E 

x€A 


#G 

#Gx’ 


whence 

i/G = p^ = i/Za+Y.^- 

But i/G„ is a power of p, since it divides p™, and f/G/f/Gx = p*"*^*, where 
m(x) > 1 . Then p divides f/G and i/X, whence also f/Zo- CD 



1.6. Orbits; the class formula 


11 


1 .6.7.3. Notes. The proof of 1.6.7.2 is one of the keys to showing that every 
finite skew field is commutative; see, for example, [AN, 37]. 

A considerable refinement of the same argument also works in studying 
Sylow subgroups of a finite group of order p’^q (where p is a prime not dividing 
g); see [SEl, 139). 

1.6.8. We shall now conclude this chapter of generalities with two more 
elaborate studies involving plane and spatial geometry, respectively. Proofs 
will only be sketched; the interested reader can fill in the details himself or 
find them in the references given. 

1.7. Tilings and crystallographic groups 

1.7.1. A tourist visiting the Alhambra in Spain—just as the reader who 
leafs through the next few pages—is bound to develop an interest in plane 
figures with regularly repeated motifs covering the whole plane. The motifs 
admit an infinite number of variations, but there is only a finite number of 
ways of reproducing them—seventeen (see figures 1.7.4.1 through 1.7.4.5 and 
1.7.6.1 through 1.7.6.12). At the Alhambra only thirteen of these patterns 
are present; two of the remaining ones are found in the town of Toledo and 
the last two are represented in the native art of various tribes. The fact that 
animal or human motifs are not depicted in the Alhambra (unlike figures 
1.7.4.8 and 1.7.6.13, for example) is solely due to the Islamic ban on such 
representations. It may in fact be argued that this limitation on the devel¬ 
opment of pictorial motifs steered the artists’ creativity towards new ways of 
repeating the available ones, accounting for the surprisingly high number (for 
the time) of patterns which were used. 

The aim of this section is to give an axiomatization of these “regularly 
repeated motifs” and sketch a proof of the existence of only seventeen patterns 
(but see also 1.7.7.8). 

1.7.2. We begin our axiomatization by assuming that the artist fills his 
surface with copies of one standard tile. To simplify matters, we require at 
first that he use his tiles in any position, but without turning them over— 
maybe the tiles are decorated on one side only (see figures 1.7.6.1 through 
1.7.6.14). 

Let E be the Euclidean plane, and P <Z E a. subset in the shape of the 
standard tile. We want to express the fact that P and its copies fill the 
whole plane without leaving any gaps; thus axiom (CG2) in 1.7.3. This by 
itself, however, is insufficient to obtain only “regular” tilings such as those at 
the Alhambra, since there exist “non-regular” tilings, like the one in figure 
I.7.2.I. 

This example, due to Voderberg (jVGlj, |VG2]), is obtained from a nine¬ 
sided standard tile and its images under orientation-preserving isometries of 
the affine plane. What makes it work is the tiles’ remarkable property that, 
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Figure 1.7.2.1 (Source; |VGl]) 


when two of them are juxtaposed with only two vertices in common, the gap 
left between them can be filled with one or two other tiles. 

For that matter, the reader himself can construct a non-regular tiling 
by considering the squares formed by Joining adjacent points with integer 
coordinates in R^, and randomly dividing each square into two vertical or 
horizontal rectangles (figure 1.7.2.2). 
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Figure 1.7.2.2 


On the fascinating problem of “necessarily non-periodic tilings”, the 
reader may consult [RNlj, [RN2] (cf. 19.6.12), [RN3], [GA]. See also 1.9.16 
and figure 1.9.16.2. 
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In order to restrict ourselves to “regular' tilings, we must introduce a 
group of isometries of E, as in axiom (CGl) below. In figures 1.7.2.1 amd 

1.7.2. '2 no such group can be found. 

1.7.3. Axioms for tilings. A tiling consists of a connected compact 
subset P of the Euclidean plane E and a subgroup G of the group Is'^ (E) 

of orientation-preserving isometries of E, suck that the interior P of P is 
non-empty and the following conditions are satisfied: 


CGI) 

CG2) 


U 9{P) = E 

gen 

g{P) = h{P) whenever g(P) n h(P) / 0 


The group G is called a crystallographic group. 


1.7.4. Our aim here is to prove that, up to conjugation in the linear group 
of E, there exist only five crystallographic groups G, corresponding to fig¬ 
ures 1.7.4.1 through 1.7.4.5. (These figures depict fundamental tilings, which 
means that they satisfy the following axiom, stronger than (CG2): g(P) n 
h(P) ^ 0 implies g = h.) 



Figure 1.7.4.1 





Figure 1.7.4.2 



\a 





\a 
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\a 


\a 


r 




r' 




Figure 1.7.4.3 


Figure 1.7.4.4 



Figure 1.7.4.5 
(Source: |BD]) 
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While the number of crystallographic groups is finite, the tiles themselves 
admit of infinite variations (figures 1.7.4.6, 1.7.4.7 and 1.7.4.8). 




M. C. Escher, Perpetual Motion, 1953. Watercolor, 305 x 230 mm, 
Escher Foundation-Haags Gemeentemuseum (The Hague) 


Figure 1.7.4 8 (See [ER]) 
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1.7.5. Proof of 1.7.4. 

We shall use freely, sometimes without giving the exact reference, facts from 
plane affine Euclidean geometry found in chapter 9. Let E be the vector space 
underlying E, and Is'*’(£) —» GL(E} the homomorphism whose kernel T(E) 
is the group of translations of E (see 2.3.3.4 if necessary). Recall (cf. 9.3.4) 
that a map / € Is'^ {E)\T{E) is necessarily a rotation by some angle around 
its unique fixed point, called its center. The first key idea in the proof is the 
following: 

1.7.5.1. The group G acts discretely on E. This means that, for any a €: E, 
the orbit G(a) b discrete in E, i.e., it consbts solely of bolated points. (We 
recall that a point x of a metric space X b called isolated if there exists e > 0 
such that B(x,e)riX =■ {x}.) In particular, the intersection of any orbit with 
a compact set is finite, since a discrete compact set b finite. 

To prove the action is discrete, remark first that every compact set of E 
contains only a finite number of dbtinct tiles g(P). This comes from (CG2) 
and the fact that every sequence of points in a compact set has a convergent 
subsequence. Next, observe that the stabilber of a tile P' — g(P), defined as 
Gpi = { g € G I g(P') — P' }, is always finite. In fact, by 9.8.6.1 , there exists 
a' € E such that 


Gpi C G„i = {g € G I g{a') = a' }. 

Choose another tile P" such that the associated point a" given by 9.8.6.1 b 
different from a'. Since only the identity leaves both a" and a' fixed (see 9.1.6 
if necessary), the cardinality of Gp< is equal to the number of tiles g(P"), for 
g e Gpi. Since these tiles are dbtinct and contained in a circle of fixed radius, 
there are only finitely many of them (figure 1.7.5.1). 



Figure 1.7.5.1 
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Now for the proof that every orbit G(a) is discrete. Since G is a group of 
isometries, it is enough to show that a is isolated in G(a). Take an arbitrary 
17 > 0 . The number of points of G(a)nB(a, rj) lying in any given tile b finite, 
by the paragraph above, and the number of tiles intersecting B(a, ri) is abo 
finite, since they are all contained in the compact disc B(a,ri + £), where 6 
is the diameter of a tile. Thus G{a) n B(a,t]) is a finite set, and we can find 
€ such that G(a) n B(a,e) = {a}. □ 

Now consider the subgroup F = GnT(E) of translations of G. The second 
key point in the proof b the following property: 

1.7.5.2. The group F is a lattice, i.e., there exists a basis {tl, w} of E such 
that F consists exactly of translations by vectors in Zu + Zv. 

Proof. We start by proving, by contradiction, that F contains at least two 
linearly independent translations. Assume first that F = Id^, that b, G 
contains only rotations; if two such rotations r, s had different centers, their 
commutator would be a non-trivial translation, by 1.7.5.0. Thus 

the rotations of G all have the same center w, and Uggr contained in 

a dbc of fixed radius, contradicting (CGl) (figure 1.7.5.2.1). 



Now suppose that the directions of elements of F are all parallel. Take 
r e G\F and a translation < € F by a vector Then b a translation 

by the vector r(D, and, since r(D must be parallel to we conclude that r 
is a reflection through some point. All elements of G\F are thus reflections 
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through points; but if two reflections r, s have centers o, a', then sr is a trans¬ 
lation by the vector 2^ (figure 1.7.5.2.2). Thus the centers of all elements of 
C\r are on a line D parallel to the direction of the translations; this implies 
that U 36 O 9(P) is contained in a strip centered on D (figure 1.7.5.2.3), again 
contradicting (CGI). 

This shows that F contains translations by two linearly independent vec¬ 
tors. There remains to show that there exist u, v such that 

r = { translations by j tiJ 6 Zu -f- ZiT}. 

By dbcreteness (1.7.5.1), we can choose a translation in F by a vector u with 
minimal norm ||u||, then another translation in F by tT ^ Zu with minimal 
norm ||tT||. We will show that u and v are the desired vectors. 

Let Q be the parallelogram 

<3 = { a 4- tu + sw I t, s e [0, 1 ) }, 

for a fixed a & E. Since G is a group, and 

F 3 {translations by uJ j u; 6 Zu -f- ZtJ}, 

the images y(<3), y £ F, fill E. Thus, if there is a point in the orbit of a under 
F that is not in a 4 - Zu 4 - Zu, there is also one, which we call y, inside Q 
(figure 1.7.5.2.4). 


lattice 




We claim that the distance from y to one of the vertices of is strictly 
less than ||u|| or {ju|{; to see this, assume, without loss of generality, that y is 
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in the interior of the triangle {a,b = a + il,c = a + w}, and observe that the 
line {a,y) intersects the side {b,c) at a', and we have d(a,y) < d(a,a') (figure 
1.7.5.2.5). But then d(a, a') < (d(a,fe) + d(a, c))/2 = (l|u|| + ||wl|)/2, proving 
our claim, and contradicting the choice of u and v. □ 

1.7.5.3. Now let G' = G\r. If G' is empty, we’re in the situation of figure 
1.7.4.1; otherwise, the elements of G' are all true rotations, and they must 
be of finite order by (CG2). Assume first that they all have order two; then 
we’re in the situation of figure 1.7.4.2. Suppose now that r 6 G' has order 
a > 3, and assume also its angle is 2w/a, which is always possible. Call its 
center a. Let b ^ a be the center of a rotation in G' such that d[b,a) is 
minimal, and let s be a rotation of center b, order > S and angle 2n/fi. Put 
t = (rs)~*, so that ret = Idg. Now 9.3.6 shows that the center c of t b such 
that the angles of the triangle formed by a,b,c are half the angles of r,s,t, 
respectively (figure 1.7.5.3.1). 



Figure 1.7.5.3.1 


c 



We now show that the choice of b implies that the angle of f is 2 ?r/')f, 
where 7 is the order of t. In fact, if t had angle 27rn/7, n > 2, there would exist 
a rotation t' € G' with center t and angle less than 27rn/7. Then, applying 
the result from 9.3.6 mentioned above, it would follow that (rt')~* € G' has 
center a', a point closer to a than b is, contradicting the choice of b (figure 
1.7.5.3.2). 

Since the sum of angles of a triangle is tt, we obtain the basic condition 


ill 

0 + ^ + 7 


= 1 . 


But a>3, /3>3, 7>2 are all integers, so the only possibilities are the 
following; 



a 


Tf 

case I 

3 

3 

3 

case II 

2 

4 

4 

case III 

2 

3 

6 


We easily deduce 1.7.4, by showing that cases I, II and III correspond to the 
groups associated with figures 1.7.4.3, 1.7.4.4 and 1.7.4.5, respectively. 







1.7.7. Notes. 

1.7.7.1. The first rigorous proof that the number of crystallographic groups 
is equal to seventeen seems to be due to Fedorov (1891). For more historical 
or physical information, see [FT2, 38], |CRl, 279], and [H-C, chapter Ilj. 

Fedorov’s prime interest was in crystaUography. The most recent and 
complete reference on plane and spatial crystallographic groups is [BT]; [CR2] 
contains some very perceptive and precise historical remarks. See especially 
[SCI]; also 1.9.14, |SC2] and |B-B-N-W-Z]. 

















































Figure 1,7.6.11 


Figure 1.7.6.12 


Source; |BDj 


1.7.7.2. The problem of tilings can be extended to three-dimensional Euclid¬ 
ean space, and also to other, non-Euclidean, spaces, for example, spheres and 
hyperbolic spaces. Tilings of the sphere are treated in the next section 
(especially 1.8.6), and hyperbolic tilings in 19.6.12. 

1.7.7.3. For tilings of three-dimensional Euclidean space, see |BT). The com¬ 
plexity of the classification of crystallographic groups increases fast with the 
























































M. C. Escher—Sketch for a periodic tiling by horsemen 
India ink and wntercolor Escher Foundation-Haags Gemeentemuseum, The Hague 


Figure 1.7.6.13 


Figure 1.7.6.14. The tile P. 
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dimension; there are 230 crystallographic groups in three dimensions, 4783 
in four. The fact that their number is Suite in any dimension is non-trivial 
and is known as Bieberbach’s theorem; the proof is based on a generalization 
of 1.7.5.2, but is much harder. See |WF, 100]. 

1.7.7.4. It is a little easier to classify the crystallographic groups (in arbitrary 
dimension) that act with no Sxed points, i.e., such that the stabilizer of any 
point is trivial. The interest in such groups lies in that the quotient space 
E/G is locally homeomorphic to E, and is thus a differentiable manifold. In 
two dimensions, only two groups satisfy this condition: the group of figure 
1.7.4.1, which has as quotient the two-torus R^/Z^, and the group of figure 
1.7.6.3, whose quotient is the famous Klein bottle. For a classification in 
arbitrary dimension, see |WF, chapter 3); in three dimensions there are six 
such groups ([WF, 117]). 

1.7.7.5. On tilings and marginally related, but equally captivating, subjects, 
see [FTl] and [FT2|. 

1.7.7.6. For pictures similar to 1.7.4.8 and 1.7.6.13, see |FT2, plates I, II 
and III], |ER] and [MG], which is entirely devoted to Escher’s use of plane 
crystallographic groups. 

1.7.7.7. See also 1.8.7, 1.9.4, 1.9.9, 1.9.12. 

1.7.7. 8 . There are finer classifications, which lead to more types of tilings. 
See jB-P-B-S, 7], as well as the two articles |GR-SHl] and [GR-SH2], which 
discuss in detail two such classifications (one with 81 types, the other with 91). 


1.8. Tilings of the two-sphere and regular polyhedra 


1.8.1. One way of summarizing 1.7.5.3 is by saying that a rotation of finite 
order with center a gives rise to a regular tiling of a circle centered at o (figure 
1.8.1). Thus, tilings of the plane lead to the study of finite subgroups of the 
group of rotations around a fixed point, which is isomorphic to 0"*'(2) (see 
section 8.3). Analogously, the study of tilings of three-dimensional Euclidean 
space leads to the study of finite subgroups of C>'*^(3), or, which is the same, 
of tilings of the sphere S^. In this section we will only deal with the latter 
question; the reader can consult [BT] for the 230 crystallographic groups in 
three dimensions. 



Figure 1.8.1 
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1.8.2. Theorem. Up to conjugation in 0(3), finite subgroups o/0'*'(3) 
fall into five types: two families, indexed bp an integer n > 2, and three 
exceptional groups. 

We begin with a description of the mathematical approach to this problem. 
Then, in 1.8.3.4, we show how regular polyhedra fit in with our result and 
lead to a visual understanding of the subject. For more on regular polyhedra, 
see 12.5 and 12.6. 

1.8.3. Outline of proof 

1.8.3.1. The point of departure is a clever use of the class formula (1.6.6). 
Another method, using regular polyhedra and Euler’s formula, is given in 
12.7.4. In section 12.6 we shall classify regular polytopes in arbitrary dimen¬ 
sion. 

Let G C 0"''(3) be a finite subgroup, and consider its action restricted 
to the sphere S^. We introduce the set 

r = { (g, r) € (G \ c) X 5^ I g{x) = x }, 

which projects onto the set 

= { X 6 5^ I g(x) — X for some g 6 G \ e } 

of non-trivial fixed points of G (here e denotes the identity element). The 
latter can also be seen as the set of rotational axes of G \ e (see 8.4.7.1). We 
will compute ffV in two ways, first by summing over g, and then over x. 

Since each g € G \ e has exactly two distinct fixed points (see 8,4.7.1), 
we have ffT = 2[ffG— 1). To calculate if=X, we parametrise the orbits of the 
G-action on X C 5^ by a section A, as in 1.6.6. For x e A, we have (cf. 1.6.2) 

"" IW’ 

but ifGr is constant for y £ G(x), by 1.5.3. We call this constant Ux', it is none 
other than the order of a rotation generat.ing Gy. Thus #[Gy \ e) = i/j, — 1. 
Since Gy \ e = { g € G | (g, y) € F }, we conclude that 

#r = El- - i)#o(X) = El- - ■)# = E 

x^A x^A 

whence 

1.8.S.2 

1.8.3.3. If all the i/^ are large, each quantity 1 — l/i/j is close to 1, so the 
number of summands must be two or less, otherwise the sum would be greater 
than 2. Sharpening this observation a little bit, it is easy to conclude that 
the only possible cases are the ones listed in the tables below (where n is an 
arbitrary integer > 2); 
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1.8.3.4. Thus there are only five possible cases for the cardinality of the orbits 
and the orders of the associated stabilizers. There remains to show that all 
possible cases occur, and that each case corresponds, up to conjugation in 
0(3), to a unique group G. 

The uniqueness of G is not very difficult but requires some care; see [AS]. 
For existence, observe that; 

— Case I is realized by the cyclic group of order n generated by a rotation 
of order n in R^; the two orbits consist of the two points of 5^ lying on 
the common rotation axis (figure 1.8.3.1). 

— Case II corresponds to the subgroup of O"*' (3) leaving invariant a regular 
n-sided polygon drawn on a plane of R^ and centered at the origin. This 
group, called the dihedral group of order 2n (see section 0.2), contains n 
rotations (by angles 2irk/n) around the axis perpendicular to the plane 
of the polygon, plus n reflections through the lines joining the center 
of the polygon to its vertices and to the midpoints of the sides (figure 
1.8.3.2—watch out for a small distinction between the cases n even and 
n odd). The two points of situated on the axis of the polygon form 
a single orbit. A second one consists of the vertices of the polygon (if it 
was drawn inscribed in S^), and the third, of the radial projections of 
the midpoints of the sides on S^. 



Figure 1.8.3.1 Figure 1.8.3.2 


— Cases III, IV, V are realized by the subgroups of (3) leaving invariant 
a regular tetrahedron, cube and dodecahedron, respectively, all centered 
at the origin (figures 1.8.4.3, 1.8.4.4 and 1.8.4.5). The existence of the 
tetrahedron and cube is trivial. The same cannot be said for the existence 
of the dodecahedron; we refer the reader to 12.5.5. 





Icosahedron and dodecahedron drawn by Leonardo da Vinci 
for Fra Luca Pacioli'e De Divina Proportione. 

Figure 1.8.3.8 
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1.8.4. We remark (cf. 12.5.4 and 12.S.5) that the groups of the cube and 
the regular octahedronare the same, as are those of the regular dodecahedron 
and icosahedron. Also, the tetrahedron group is isomorphic to At, the group 
of the cube to 54 , and the octahedron group to .^ 5 ; see 12.5.5.6. 

1.8.5. It is a relatively simple task to investigate the finite subgroups of 0(3), 
an extension analogous to that of 1.7.6. We find, associated with case V, a 
group of order 120, whose corresponding tiling of 5^ is shown in figure 1.8.5. 



Figure 1.8.5 (Source; ICRl]) 


1.8.6. When studying the tilings of (as well as those of the hyperbolic 
plane, cf. 19.2), the key point, the analysis in 1.7.5.2, remains valid. Specif¬ 
ically, one considers three “rotations” r, s and f such that rst = Id. In all 
three geometries. Euclidean, spherical and hyperbohc, the angles of such ro¬ 
tations are of the form 2x/a, 2x/)0, 2 jr/ 7 , where a, 7 are integers; these 
angles, moreover, are twice those of the triangle formed by the centers of the 

three rotations. In the Euclidean case we always have 
spherical case. 


® ^ ^ ^ 7 ~ 


1.8.6.1 


J. 1 1 

Q + ^ + 7 > Ij 


by 18.3.8.4, and in the hyperbolic case 5 -t-^-|-i<lby 19.5.4. 

Formula 1 . 8 . 6.1 can be taken as the starting point for the classification 
of the types of finite subgroups of 0'*'(3); the number of choices for a,; 0,7 is 

clearly finite. On the other hand, the equation ^ < 1 is satisfied by 
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all but a few triples of integers and in 19.6.2 we shall see that all 

such triples do indeed give rise to a hyperbolic tiling. Thus, in contrast with 
the Euclideaui and spherical cases, the number of hyperbolic tilings is infinite. 
For references and additional material, see also 19.6.12. 

1.8.7. Group presentations. The dihedral group D2n (cf. 0.2 and 1.8.3.4) 
can be defined by two generators r, s, satisfying the two relations 

1.8.7.1 r" = = e, rs = sr"^. 

The three exceptional finite subgroups of O'*'(3), corresponding to regular 
polyhedra (1.8.3.4), can also be defined by two generators r, s satisfying the 
relations 

r*' = s** = (rs)^ = e, 

where p = 3, g = 3 for the regular tetrahedron, p = 3, g = 4 for the cube 
and p = 3, g = 5 for the dodecahedron. Observe that the existence of a finite 
group with each of these presentations is not obvious. 

Plane crystallographic groups can be presented in a similar way. The 
one corresponding to figure 1.7.4.1, for example, has two generators r, s and 
the relation rs = sr. We leave the others to the reader. 

Another way of describing the dihedral group is by using two generators 
r, s that satisfy only the relations 

1.8.7-2 = (rs)" = e. 

The presentation of the symmetry groups of polyhedra in 0(3) is similar: 
three generators r, s, t, satisfying the relations 

1.8. 7.8 r^ = s^ = = (rs)** = = (tr)^ = e, 

where p and q are as above. The order of these groups is 24, 48 and 120, 
respectively; see 12.5.4.1, 12.5.4.2 and 12.5.5.6. 

Observe the close similarity between 1.8.7.2 and 1.8.7.3; in fact, these are 
all cases of more general groups, defined as discrete subgroups of the group 
of isometries of an affine Euclidean space, whose generators are reflections 
through hyperplanes. The interest in such groups derives from the fact that 
they occur outside the bounds of Euclidean geometry; in particular, they play 
an essential role in the study of semisimple Lie groups, a field that has recently 
experienced a spectacular growth. The reader interested in this subject or in 
the details of the examples above should consult |C-M] (in particular pp. 38- 
51) and |BI4] (especially the historical note, pp. 234-240). 

1.9. Exercises 

1.9.1. Study the orbits of the Z-action on the circle 5^ C = C given 
by (fi(n){z) = e‘“" 0 , according to the nature of the real number a. If a 
is irrational, can this be used to construct the counterexample mentioned 
in 1.5.7? 
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1.9.2. Study the same question for the R-action on given by z') = 

(e'*z, a 6 R, where the notation is the same as in 1.2.9. 

1.9.3. Fill in the details of the proof of theorem 1.8.2. 

* 1.9.4. For each of the five crystallographic groups containing only proper 
motions, then for all seventeen, find the following: the order of the stabilizers; 
the structure of the group; the different types of orbits; a presentation for the 
group (cf. 1.8.7). 

1.9.5. Same question for the finite subgroups of O'*" (3) (resp. 0(3)) acting 
on S^. 

1.9.6. Draw fundamental tilings of under the action of groups of type I, 
11, III and IV (see 1.8.3.3 and figure 1.8.5). 



Figure 1.9.6 (Source: [CRl]) 


1.9.7. Show that in 1.7.7.4 we do indeed obtain as quotients the torus and 
the Klein bottle. 

1.9.8. Show that there exists an infinite number of distinct tilings of the 
circle 5*, of the sphere (see for example 18.8.1), and, in general, of any 
sphere 
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1.9.9. Make a comparative critical study of the classification of crystallo¬ 
graphic groups in [GR, 72-84], |H-C, 70-81] and jWL, 22-115]. 

1.9.10. Compute H=Ge,p (cf. 1.2.5} when E is a vector space of finite dimen¬ 
sion n over a finite field of k elements. 

1.9.11. Determine all compact subgroups of O'*"(3). 

* 1.9.12. Fundamental domains. Let G C ls{E) be a subgroup of the 
group of isometries of an affine Euclidean plane; we assume all orbits of G 
are discrete subgroups of E. For a fixed a & E, show that G and the set P 
defined by 

P = { X e P ] d[x,a) < d(x, g(a)) for any g 6 G } 

satisfy the axioms of a crystallographic group (1.7.3). 

1.9.13. Valencies of a tiling. Consider a plane tiling by convex tiles 
satisfying 1.7.3. Show that the number of points in a tile P which belong 
to at least three different tiles is finite, greater than or equal to three, and 
does not depend on P. Let this number be r, let the corresponding points, 
ordered along the boundary of P, be rui,..., m,., and call a, the number of 

tiles containing m,-. Show that the sequence .r is independent, up 

to a circular permutation and a reversal, of the tile P; we call it the sequence 
of valencies of the tiling. Show that one always has 



«=i 


(if necessary, check 12.7.2 and 12.7.4). Deduce that only 23 sequences of 
valencies are possible a priori. Draw tilings whose sequences of valencies are 
(3,3,3,3,3,3), (3,3,3.3,6), (3,3, 3,4,4), (3, 3, 4,3,4), (4,4.4,4), (3,6,3,6), 
(3,4,6, 4), (6,6,6), (4,8,8), (3,12,12) and (4,6,12). Finally, prove that the 
other sequences cannot be realized by any tiling. If necessary, see )GR-SHl], 
(GR-SH2] and the references therein. 

* 1.9.14. Does any triangle tile the plane? Any convex quadrilateral? Any 
quadrilateral? For pentagons, see jSC2j. 

1.9.15. Consider, in S3, the two transpositions s = (213) arid t = (132). 
Show that t o s has order 3. Deduce that if S3 acts on a group G by homo- 
morphisnis and some element g of G is invariant under s (i.e., s(g) = g) and 
anti-invariant under t (i.e., <(g) = g~^) we have g = g“*. For an application 
of this, see 9.5.4.9. 

* 1.9.16. Show that with the six tiles in figure 1.9.16.1 it is possible to tile the 
plane. Show also that any tiling with these tiles can never be periodic, i.e., the 
group of isometries which leaves the tiling globally invariant cannot contain a 
non-trivial translation. If you have trouble, consult ]RN 1], or make a sufficient 
number of photocopies of the tiles and start working with them. See figure 
1.9.16.2 for another necessarily non-periodical tiling, due to Penrose. 
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Chapter 2 

Affine spaces 


In this chapter we study affine spaces, the most frequent geo¬ 
metric spaces in this book. An affine space is nothing more than 
a vector space whose origin we try to forget about, by adding 
translations to the linear maps. It follows that the elementary 
properties of affine spaces, of their morphisms and of their sub¬ 
spaces are all properties from linear algebra, more or less dis¬ 
guised. As a consequence, most demonstrations are automatic: 
to prove a result about an affine space, the idea is generally to 
use some appropriate translation to reduce the statement to a 
vector form. This often means vectorializing the space at the 
appropriate point. 

From section 2.5 on, geometry again occupies the foreground, 
starting with the theorems of Thales, Pappus and Desargues. 
The only harder, but very elegant, result is the fundamental 
theorem of affine geometry, section 2.6, which states that a 
set-theoretical map between affine spaces which takes collinear 
points into collinear points is, in essence, an affine map. 

The end of the chapter is devoted to finite-dimensional real 
affine spaces. Among the results about such spaces that will 
be needed in the rest of the book is the following: a finite¬ 
dimensional real affine space (as well as its affine group GA(X)) 
has a canonical topology, and also a Lebesgue measure, unique 
up to multiplication by a scalar. Using this measure we associate 
to each compact subset K of an affine space X a point, called 
its center, which is invariant under automorphisms of X that 
leave K globally invariant. Along the same lines, we show that 
the subgroup of GA(X) formed by automorphisms that leave 
globally invariant a compact set K with non-empty interior is 
compact; conversely, every compact subgroup of GA(JVr) is con¬ 
tained in the stabilizer of a point of X. 

For more details on the proofs in sections 2.1 through 2.6, 
the best is to consult (FL|. 
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All vector spaces considered in this chapter are over a commutative 
field. When we consider together two vector or affine spaces, they will al¬ 
ways be spaces over the same field, with the single and important exception 
of section 2.6. 

It is an interesting exercise to study carefully where exactly the com¬ 
mutativity of the base field is used, and, when appropriate, to modify the 
statements of the results so as to render them valid for skew fields as well. 


2.1. Definitions 

2.1.1. Definition. An affine space over afield K is a faithful and transitive 
group action (X, .Y,$), where X is a vector space over K considered with its 
additive group structure. The vector space X is said to underlie the affine 
space X. 

2.1.2. Notation. We will generally denote affine spaces by X. We put 

i) = I + f. 

The maps are called translations of X; more precisely, $(1) is called a 
translation by the vector and will be denoted by t^. 



2.1.3. By 1.4.4.1, the action (X, X, $) is simply transitive, so there exists a 
map 0 : X X X —* X such that y = ^>{©(i, y),i) for all x, y S X. We set 
iy = 0(x,y), and sometimes say that xy is the free vector associated with 
the bound vector (i, ly), or with the pair (x, y). We also write 

2.1.4 xy=y-i; 

a certain justification for this notation is given in 3.1.7. The fact that $ is 
an X-action can be translated as follows; 

(x+() + Tf=x-^ (|’4 »f); 

we can simply write x + ^ + rf. In particular, 0 satisfies 

©I ; X 9 y 0(i, y) g X is a bijection for all x 6 X, 

0(x, y) + 0(y, z) = 0(x, z) for all x,y,z & X, 


2.1.5 
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since we have ©1^(1) = x + ^. The identity 0(x, y) + 6(y, z) = 6(1,2) is 
known as Chasles’ relation. 

2.1.6. Alternative definition. Let X be a non-empty set, X a vector 
space over K, and Q : X x. X X a map satisfying the conditions in 2.1.5. 
Then X is an affine space under the action $(D(x) = This definition 

is indeed equivalent to 2.1.1, for we have 6(x, x) = 0, 6(y, x) = —©(x, y), 
^ o $(D = Idx, and thus ^(rf) o = ^(r} + 1). 

The bijections ■ X —* X also satisfy the following relations: 

2.1.7 ©“^ o ©j, = $(xy), ©j, o ©J* : 2 2 + ^ for all x, y € X. 




Figure 2.1.7 


2.1.8. Definition. The dimension of X, denoted by dimX, is defined as 
the dimension of X. If dim X = 0, X is called a point; if dimX = 1, X is 
an affine line, and if dim X — 2, X is called an affine plane. 

2.1.9. Definition. If a €: X, we denote by X„ the set X endowed with the 
vector space structure such that ©« '. X —* X is a vector space isomorphism. 
We say that X„ is the vectorialization of X at a. 


2.2. Examples. Affine frames 


2.2.1. Take X = X and ^(D(^ = ^ + f) for all I", »j 6 X. In this way we 
obtain a natural affine structure on any vector space. 

2.2.2. H (X, X, $), (X', X', $') are two affine spaces, then so is (X x X', X x 
X', $ X $'), where 
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2.2.3. Let be a vector space, F a vector subspace of E, and X a translate 
of F, i.e., a coset of F under the additive group structure of E. Then X has a 
natural affine space structure (Jf, $), where $(/)(x) — x + f forallxeJf 
and f € F. It is important to observe that here the set X does not possess a 
distinguished point, contrary to the trivial example in 2.2.1, where the origin 
plays a special role. In particular, if is a hyperplane of E, giving a point 
in X is equivalent to giving a complementary subspace to F in £7, and, in 
the absence any structure other than that of vector space, there is no such 
thing as a “natural” complementary subspace. (See 2.2.5 for an example that 
elaborates on this remark.) This “homogeneity” is one of the motivations for 
the concept of an affine space. 


/ 


X 





Figure 2.2.3 


An important case of a space X as above is the set of solutions of a system 
of linear equations ctij-Xj = (t = l,...,fc). Indeed, this set is just the 

inverse image /“*(&) of the point b = (6i,..., 6fc) under the appropriate map 
/, and is thus a translate of the kernel /"^(O) = F. 

2.2.4. Let (X, .^,$) be an affine space and S C X be a vector subspace. We 
have an equivalence relation defined on X by 

X ~ x' •<==>■ X3? 6 S. 

Then (X/ has a natural affine structure, where is defined by 

passing to the quotients. We shall see in 2.4.2 what the equivalence classes 
are. 



Figure 2.2.4 
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2.2.5. Let £ be a vector space, and H C E a, hyperplane. Set 

Ey = { ly I W is a complementary subspace to H in E}. 

Observe that each ly is a one-dimensional vector subspace, and that, with 
the notation of 1.2.5, we can write Ey = GE^i\Gy i. Then we have the 
following 

2.2.6. Proposition. The set Ey has a natural affine structure, the under¬ 
lying vector space being Ey = L(E/H;H) (cf. 0.2). 



Proof. Observe that dim E/H = 1, whence 

dim Sy — dim H = dim E — 


We define the affine structure via the map 0 : Ey X Ey —* L(E/H\H). 
Let ly,ly' € Ey, a € E/H. H we denote hy p : E E/H the canonical 
projection, p~^(a) is an affine hyperplane of E (cf. 2.2.3); we know that p, 
restricted to an arbitrary complementary subspace to H, is an isomorphism, 
so we can put 

0{W,Vy')(a) = (pk')-‘(a) - (p|w)“‘(“) e H. 

See figure 2.2.6, where w' = (p|w’)”^(o), w = (p|w)~^(a). It is immediately 
seen that 0 satisfies axioms 2.1.5. □ 

The example above will be essential in section 5.1. It is natural to view this 
example and the one in 2.2.3 together (figures 2.2.3 and 2.2.6); the reader can 
demonstrate the following proposition, looking up the notion of isomorphism 
between affine spaces in section 2.3 if necessary. 

2.2.7. Proposition. Let E be a vector space, F a hyperplane of E and X 
a translate of X. Then the map W —* W n X gives a natural affine space 
isomorphism between Ey and X. □ 
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2.2.8. In order to be able to calculate in affine spaces, we introduce something 
similar to vector space bases. By convention, 

we define affine frames for finite-dimensional affine spaces only. 

2.2.9. Definition. An (affine) frame for the affine space X consists of d+ 1 

points {xi}j=o.i. d in X suck that {xo2:t}t=i. d is o, basis for X (thus we 

necessarily have d — dimX). The coordinates of x & X in this frame are the 

scalars {Ai}i=i.<i suck that xqx = AiXQii, i.e., the coordinates of xqx in 

the basis {x()Xi}i=i. ^ of X. fVe can write x = {Ai,...,Arf). 



Figure 2.2.9 

2.2.10. Remark. It amounts to the same thing to say that {x<}<=i. 4 is 

a basis for the vectorialization Xx„ (cf. 2.1.9), or that {xo, xi,..., xj} is a 
simplex of X (cf. 2.4.7). 

2.3. Affine morphisms 

Let (X, .^,0) and (X',.X',0') be two affine spaces (over the same field, 
cf. 0.2), and f : X —* X' a map (in the set-theoretical sense). 

2.3.1. Proposition and definition. The following conditions are equiv¬ 
alent: 

i) /eL(X,.;X}(a))/or some a £ X; 

ii) / e L(X„; X'j (a)) for all a € X; 

Hi) 0/(„) 0/0 0“* e L{X\X') for some a € X; 
iv) 0y(„) 0/0 0~* e L(X-,X') for all a € X. 

Moreover, if these conditions are satisfied, Qf{a)°f ^ L(X;X') depends 

only on f, and will be denoted by f or L(f). We have f o Q — Q o (f x f). 
A map satisfying the properties above is called an (affine) morphism, or an 
affine map, between X and X'. The set of suck maps is denoted by A(X;X'). 
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We say that f is an (aSne) isomorphism if f is a morphism and is bijective; 
f is an automorphism if f is an isomorphism from X into itself. □ 




1 - 


X 


/X/ 

f 


X' X X' 

I"' 

X' 


2.3.2. If f : X —* X' is an affine map, we have, for all x, y € X: 


f{x)f{y) = f(^)y f{y) = f(x) + f{^). 

The conclusion is that, heuristically, / consists of a translation and a linear 
map. 

2.3.3. Examples. 

2.3 3.1. H X = X' = R, we recover the well-known maps x —* ax + b, for 

a^b ^ R. 

2.3.3.2. Every constant map / is affine, and satbfies / = 0. Conversely, if / 
is affine and / = 0, then / is constant. 

2.3.3.3. The identity map / = Idx is affine, and / = Id^. For the converse, 
see below. 

2.3.3.4. If / e ^(X;X') and g € i4(X';X") we have go f e ^{X;X"), and 
9° f = 9° f- In particular, the set 

GA(X) = A(X;X)n5x 

is a group under composition of maps, and is called the affine group of X. 
We have a homomorphism 

L : GA(X) 9 / -* / e GL(X), 

where GL(X) = L(X;X) n is the linear group of X. The kernel of L 
(defined as L“*(ld^)) is evidently X^, the set of translations of X (cf. 2.1.2). 
It is a normal subgroup of GA(X), and will most often be denoted by T'(X) = 
Ker L. 

2.3.3.5. The affine group acts simply transitively on affine frames. This can 

be seen, for instance, by observing that for two affine frames {xi},=o.and 

{xJ},=o. d of X and X', respectively, there exists a unique / 6 A(X;X') 

such that /(x, ) = x' for i = 0,..., d. 

2.3.3.6. In particular, X is a homogeneous space for GA(X) (which is a big¬ 
ger group than X = T[X) used in the definition, 2.1.1). Denote by GAo(X) 
the stabilizer of a £ X in GA(X). Then (cf. 1.5.5) 

XSGA(X)/GA„(X) 

as sets, and the restriction L : GAa(X) —» GL(X) is a group isomorphism. 
To recover GA(X) from T(X) and GAa(X), we introduce the following 
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2.3.3.7. Definition. A group G is called the semidirect product of two of 
its subgroups H and K ifG=HK = {hlc\heH,ke:K}, H n K = {e} 
and H is normal. 

It follows that any g can be written as hA; in a unique way, that G — K ■ H 
and that g can be written as k'h in a unique way, but in general k ^ Id. 

2.3.3.8. Proposition. If a & X, we have a semidirect product 

GA{X) = T{X)GA„[X). 


Proof. The requbite map from GA(A') into T(X) x GA„(Jf) is evidently 

/•-□ 

f{a)a f(a)a 

2.3.3.9. For every a € X, A € K* = if \ 0, we call the map 


Ha.x : X —* a + Xax 


the homothety of center a and ratio X. Homotheties are affine maps, wd we 
have Ha.x = (use 2.3.2). Here the commutativity of K is essential. 


a 



Figure 2.3.3.9.1 



Figure 2.3 3.9.2 
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The converse is also true; 

2.8.3.10. Proposition. Let f e GA(Jf) be suck that f = where 

X & K \1. Then there exists a unique a such that f = Ha,x- 

Proof. For this, as for most not too subtle results about affine spaces, it is 
enough to vectorialize. Here we take an arbitrary 6 € Af and work in the 
vector space X(,. Finding a such that /(a) = o, is, by 2.3.2, equivalent to 
solving a = f{b) + A(a — 6). There is a unique solution 

a= ^(/(6)-Afc). □ 

We also have, for every a € X and X,/i K, 

2.3.3.11 = Ha.Xt.. 

When A = 1, proposition 2.3.3.10 doesn’t work, but then we know that / is 
a translation (cf. 2.3.3.4). In order to collect together all morphisms / such 
that / = A Id^ (A € if*), we work as follows: 

2.3.3.12. Proposition and definition. The center o/GL(X) is 
The inverse image under L (cf. 2.S.S.4) of K* Id^ is a normal subgroup of 
GA(A^), denoted by Dil(Jf). As a set, Dil(Af) is the union of T(X) and the 
set of Hn.x fora € X and X € K*\\; furthermore, T[X) is a normal subgroup 
of Dil(X). The elements o/Dil(Af) are called dilatations of X. □ 

The proof of the first assertion is exercise 2.8.4 (see 8.2.16 if necessary). In 

2.5.6 we shall see a nice geometric characterization of dilatations. 

2.3.4. Does A(X-, X') have a natural algebraic structure, similar to the vector 
space structure of L(X;X')? The interested reader can refer to (FL] to see 
how A(X', AT') is naturally made into an affine space of dimension 

dimA(A;A’') = (dimAf + 1) dim AT’. 

See 2.3.9 for a heuristic derivation of this value. 

2.3.5. Let E,E' be vector spaces, H,H' hyperplanes of E,E', respectively. 
Consider the associated affine spaces X = Eu, X' = E'jf, (see 2.2.5). Take 
/ € L(E\ E'); is there a derived morphism f : X —* X’l A necessary condition 
is that, for any one-diinensional vector subspace W of £ complementary to 
H, the image /(W) still be one-dimensional and transversal to H'. This is 
guaranteed by introducing the set 

2.3.6 Lh.H'{E;E') = { / e L{E-, Ef) \ f(H) C H' and / is injective }, 

where / € L{E/H',E'/H') is the quotient map, well-defined because f(H) C 
H'. For such an / we indeed have f[W) C E'^ for any W € Ejj\ the map 
thus defined is denoted by / : Ey —♦ Ejj,. The following proposition is 
straightforward: 
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Figure 2.3.5 


2.3.7. Proposition. For f e Lun‘(E,E') we have f e A{X\X'). The 
map L(f) = / : L{L[E/H, H), L[E‘/W, H‘)) is identical with 

T) f or)of~^. 

Moreover, f — g if and only if there exists k € K* such that g = kf. □ 


EjH 

E'/H' 

’I 

4- 

H 

^ H' 


For a converse of 2.3.7, see 3.2.1 and 5.1.3. The motivation for 2.3.7 comes 
in chapter 5. 


2.3.8. Use of coordinates. Let / e A[X-,X'), and fix two affine frames 

{2:i}i=().i.n and {a:'}j=o.i.|, for X and X' . Put e", = xoii, = x^iy, and 

introduce the matrix associated with /; 


M(f) 




From 2.3.2 we have 


/(x) = f[x„) + f(x„x) = f(xo) + > 


where x = (Ai,...,An). We can write x|,/'(xo) = whence AJ- = 

ay -I- Ylk where /(x) = (Aj,..., A|,). In other words, we have 


f{^) 
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This can be also expressed by the following product of matrices: 


/ M 




o 

o 


= M(/) 

>1 

, where M(f) = 

ai 


UpJ 

M(f) 

Va;,J 



W,, / 


This allows us to perform any calculations in practice, for example, computing 
the composition g o f of f € A(X-,X') with g € A[X'; X"). The true re<ison 
why this works will be given in 3.2.5. Finally, it is clear that the natural 
dimension of A[X\ X') is the number of parameters involved in M(f), which 
is equal to p + pn = p(n + 1) = dim A''(dim X + l) (cf. 2.3.4). 

2.4. Affine subspaces 

Affine subspaces are the first objects introduced in geometry: lines in 
the plane, lines and planes in space. From the mathematical point of view, 
however, it is preferable to introduce morphisms before subspaces. 

2.4.1. Definition and proposition. Let X be an affine space and Y a 
non-empty subset of X. The following conditions are equivalent: 

i) There exists a €.Y such that ©^(K) is a vector subspace of X; 

ii) For any a e K, ©<i(l^) w a vector subspace of X; 

iii) There exists a € K such that Y is a vector subspace of Xa! 

iv) For any a eY, Y is a vector subspace of Xa! 

v) There exist a € X and a vector subspace C X such that Y = o + K. 

If Y satisfies the conditions above we call it an (affine) subspace of X. The 
vector space 0(y x V) = Qa(F) = (a &Y arbitrary) is called the direction 
of Y and is denoted by Y ; this is consistent because (F, y,0 | K X Y) is an 
affine space. The injection i :Y X belongs to A(Y',X). The dimension of 
Y is its dimension as an affine space. □ 



Condition (e) displays very well the real nature of affine subspaces—they are 
vector spaces, up to translations (cf. 2.3.2). 

2.4.2. Examples. 

2.4.2.1. Zero-dimensional subspaces of X called points of X. One- and 
two-dimensional subspaces are (affine) lines and planes, respectively. Affine 
subspaces whose direction is a vector hyperplane of X are also called (affine) 
hyperplanes of X. 
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2.4.2.2. Let / € A{X',X'), let K be a subspace of X and Y' a subspace of 
X'. Then f(Y) is a subspace of X' and f~^(Y') is a subspace of X (if it is 

non-empty). Furthermore, S{Y) = f(Y) and f~^(Y') = 

2.4.2.3. There is a one-to-one correspondence between the afline subspaces 

Y of Eh (see 2.2.5) and the vector subspaces Y of E, where Y is the vector 
subspace of E spanned by Uwev Notice that dim? = dimK — 1, and 

Y = L(E/H-,YnH). 

Similarly, in example 2.2.3, there is a one-to-one correspondence between 
subspaces K of X and vector subspaces Y, where Y is the vector subspace of 
E spanned by Y. We then have Y — Y r\ F. 



2.4. 2.4. Let (y<)<6i t*® arbitrary family of subspaces of X. Then the 
intersection Hi ?« is either empty or a subspace; in the latter case we have 

i i 

The following result is a classical consequence of this: 

2.4.2.5. Proposition. Let S C X be an arbitrary non-empty subset of an 

affine space X. There exists a smallest subspace containing S, called the 
subspace spanned by S, and denoted by {S). It is equal to the intersection of 
all subspaces containing S. □ 

2.4.2.6. We conclude that (if dimX = d < oo) a (d-f l)-point subset 

{x,},=,o.i.<i of X is an afline frame if and only if (xq, xi,..., x,j) = X. 

2.4.3. Definition and proposition. The points {e,}i=(, .t of X are 

said to be (affine) independent if dim(eoi • • • i efe) = it. This happens if and 
only if ei ^ (ci,..., Cj, ..., e/t) (cf. 0.1) for every i = 0, ..., fc. □ 

2.4.4. For instance, two points x,y are independent if and only if x / y. 
In this case there is a unique line containing them, namely, (x, y). We thus 
recover one of Euclid’s axioms: two distinct points determine a unique line. 

2.4.5. Proposition. Let X be an affine space over a field of characteristic 

^ 2. In order that a non-empty subset Y of X be a subspace it is necessary 
and sufficient that (x,y) C Y for every x, y€ ?. D 





44 


Chapter 2. ASine spaces 



This, too, is an ancient axiom: a plan in space is a subset which contains the 
line joining any two of its points. The proof consists of vectorializing X at 
0 6 K, and calculating in X^. For x and y in Y, we have (0, x) = Kx C Y 
and X + y g {0i + y)/2) C Y. This doesn’t work in characteristic 2, and 

indeed the proposition is not valid in this case. 

2.4.6. Note. Let a, 6, c be three collinear points (i.e., lying on the same 
line). Then it makes sense to talk about the ratio ac/ab 6 K. 

2.4.7. SlMPLlCES. A simplex in an affine space X of finite dimension d is 
a set of d + 1 independent points of X. A two-dimensional simplex is called 
a triangle, and a three-dimensional simplex is a tetrahedron. The sides of a 
triangle {x,y,z} are the lines (x,y), {y,z), {y,t) and {z,x). The edges of a 
tetrahedron {x,y,z,t} are the lines (x,y), {x,^), (x,t), (y,z), (y,t) and {z,t), 
and its faces are the planes {x,y,z), {y,z,t), {z,t,x) and (t,x, y). Sometimes 
a set of three independent points in a space X of arbitrary dimension is also 
called a triangle. 




Figure 2.4.7 

2.4.8. Analytic geometry: equations defining subspaces. Here, 

as in 2.3.8, we develop tools to perform calculations related with subspaces. 
Recall (cf. 2.2.8) that dim X = d < oo. Recall also the following facts from 
linear algebra: 

2.4.8.1. Proposition. Let E be a vector space of dimension d; its dual E‘ 
has the same dimension. Let F be an arbitrary subset of E, and [F] the vector 
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subspace of E spanned bp F. Denote by 

= {/€£* j /(F) = 0} 

the orthogonal of F in E*. We have F"*- = (F)-*- and dim[F] + dimF-*- = d. 
If S is a subset of E*, its orthogonal in E is also denoted by S'"*’; 

S"*- = {x 6 F I /(x) = 0 for any / £ S}. 

Again, jS]-*- = S-*- and dim S-*- 4- dim[S| = d. If F, F' are vector subspaces of 
E and S is a vector subspace of E*, we always have 

F-*-*^ = F, S-*--*- = S, (FnF')-*- = F-*-+F'-'-, (F+F')-*- = F-'-nF'-*-. □ 


For infinite-dimensional vector spaces, refer to |BI5, section II.2]. 


2.4.8.2. Corollai^. Let H C E be a hyperplane. There exists f & E* \ 0 
such that H = /“*^(0), and if H — /~*(0) = 3~'(0) for f,g € E*, we must 
have g = \ f for some X € K". A subset F C E is a subspace of dimension 
p if and only if there exist linearly independent functionals fi, ■ ■ ■, fd-p 6 El* 
such that F = 0^=!' /r*(0)- ^ 


We can now move on to the affine case. The thing to watch out for is that 
the kernel of non-zero constant affine forms is empty, contrary to the case of 
linear forms, whose kernel always contains zero. 


2.4.8.3. Definition. .4n affine form over an affine space X is an element 
f £ A{X;K), where K has its natural affine structure over itself (cf. 2.2.1). 
We consider on A(X-,K) its standard vector space structure. 

2.4.8.4. For example, if {x,:}i=o. d is a frame in X, every affine form can 

be written in a unique way as 


I = (Ai,..., Xd) f{x) = a 4 ^ Oi Ai 

i 

(cf. 2.3.8). A constant is also an affine form, and its kernel (i.e., the inverse 
image of 0) is empty unless the constant is zero. 

2.4.8.5. Proposition. Let X be a d-dimensional affine space and V a non¬ 
empty subset of X. Then V is a p-dimensional affine subspace of X if and 
only if there exists a (d — p)-dimensional vector subspace V C A[X',K) not 
containing the constant affine form 1 and such that 

V = {x&X\ /(x) = 0 for any / £ V'}. 

Proof. The idea is to vectorialize Af at a £ V, and work in Xa- One direction 
is trivial: if V is an affine space, take V = y-*-, and 1 because l(x) = 

1 ^ 0 for any x. For the converse, we only have to prove that F'-*- ^ 0, and 
then apply 2.4.8.1 with E = X,,- We proceed by induction on k = dimF'; 
the case A; = 0 is trivial. Let {/i,-..,/fc} be a basis of V', and put Vj = 
F/i 4 ■ 4 Kfk-i. By the induction assumption, Vi = Vj-*- ^ 0, and we just 

have to show that fk\vi is not a non-zero constant. Assume fk is constauit 
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on Ki; then /fc(xo)/fc(x) = /fc(iox) = 0 for any Xo,x S: Vi. We cam write 
V( = Kfi + + Kfk-i, whence 

fc-i 

1=1 

which implies 

fc-i 

fk(x) - /fc(xo) + Yi 

i=l 

for any x €: X, since /.(xo) = 0, and finally /fc(xo) = 0. □ 

2.4.8.6. Examples. The most important example is when dim H = p = 1 
in 2.4.8.5 (hyperplanes). Then the proposition says that there exists an affine 
form / € A(X-,K) such that H = f~^{0); conversely, if / is a non-constant 
affine form, /~*(0) is a hyperplane, and finally 

r*( 0 ) = 3-‘{0) 

is equivalent to / = kg, for k € K*. 

The commonest practical cases are fif — R, d = 2 or 3, which correspond to 
our everyday physical space. If dimA^ = 3, for example, the equation of a 
plane Y in an arbitrary frame wiU be 

Y = {(x,y,z) I ax -I- 6y -I- cz + d = O}, 

where a, b, c do not all three vanish, and are otherwise arbitrary. In the same 
space X, a line D is written 

D = {(i, y, z) \ ax by + cz ^ d a!X b'y + c'z d! = 

where the triples (a, b, c) and (a', 6', c') must be non-zero and not proportional 
to one another. 

2.4.8.7. Parametric representations. A A:-dimensional subspace Y of X 

can be determined by giving a frame {x,}i=o.fc of Y. We have 


Y = 



A,- € K for all i = 1,..., A: 


}• 


Thus, a line in will be 


{ (a -I- Xb, a' + Ah', a" + Ah") | A £ R }, 

and a plane will be 

I (a -f- Ah -t pLc, a' ■+ Ah' + pc', a" 4- Ah" + pc") | A, p £ R 

2.4.8.8. All important tool for working in the set of (vector, affine, projective) 
subspaces is the use of “grassmannian coordinates”: see [HO-PE, chapter 
VII). 
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2.4.9. Parallelism and intersection of subspaces 

2.4.9.1. Definition. Two subspaces S,T of the affine space X are called 
paraiiej if S = T, and we write S j| T. They are called weakly paralJel if 
S C f, and we write S <T. 

We see that || is an equivalence relation, and < is a partial order. We will later 
recover the equivalence between these definitions and Euclid’s definitions for 
the case of lines in a plane and lines and planes in (three-dimensional) space, 
but we must first study the intersection properties of subspaces. 



Figure 2.4.9.1 


2 . 4 . 9 . 2 . Proposition. If S || T, either S=T or SnT = 9. If S <T, 
either S C T or S OT = 9. S <T if and only if there exists a subspace S' 
of T such that 5 || 5'. Let x & X and S C X be a subspace; there exists a 
unique subspace T such that -S' || T and i 6 T. Let S, T be subspaces of X. If 
5 n r = 0, 

dim S -f dim T < dim X -I- dLm(5 -I- f) 

and 

dim(5 U r) = dim(5 + f) I = dim S + dim T -f 1 — dim(5 O f). 

If SnT^ 9, 

dim(5 U r) = dim S dimT — dim(5 n T). 



Figure 2.4 9.2 
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Proof. This proposition follows immediately from Lemma 2.4.9.3 and the 
relation 

dim5 + dim f — dim(5 4- T) + dim(5 n f), 
which holds for any subspaces S, T of a vector space. □ 

2.4.9.3. Lemma. Let a & S and b & T. Then S nT ^ H is equivalent to 

^ e s + f. 

Proof. The lemma is shown by vectorializing at o (cf. 2.1.9). Notice also that 

(SuT)^S + f + K'^. □ 

2.4.9.4. Corollary. If S and T are complementary (i.e., X = S ®T), then 
S C\T has exactly one point, and S and T are also called complementaiy. □ 

2.4.9.5. Examples. If X is a plane, D C X a. line and x € D, there exists 
a unique line D' of X containing x and such that D n D' = 0, namely, the 
line parallel to D. We thus recover Euclid’s famous fifth postulate, where the 
notion of parallelism is formulated as a property of non-intersection. 

For d = 3, D a. line and P a plane of X, either D C P, or DnP consists 
of one point, or Z) n P = 0, in which case D < P. 



2.4.9.6. Projections and reflections. Let X be an affine space, S a 
subspace, and T a complement of S in X. By 2.4.9.2 and 2.4.9.4 there exists, 
for every x € X, a unique subspace T{x) with direction T and such that 
S n T(x) has a single point, which we denote by p(x). We thus obtain a map 
p : X —* S, called the projection from X onto 5, parallel to T. We have 
peA{X-,S). 

Similarly, the rejection s through 5 and parallel to T is the map s : 
X —* X defined by 

xp[x) = p(x)s{x) 

for all X ^ X. We have = sos = Idx and s 6 A(X\ Jf). In the nomenclature 
of 3.4.2, we can say that s(i) is defined by the property that p(x) is the 
midpoint of x and s(x). See exercise 2.8.5 for maps with the properties 
= Idx and = p. 

2.4.9.7. See |FL, 68] for the relationship between subspaces and fixed points 
of endomorphisms of X (i.e., elements of j 4(X; Jf)). 
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2.5.1. Proposition (Thales’ Theorem). Let H,H',H" be three dis¬ 
tinct parallel hyperplanes t» an affine space X, and (Di)i£i a family of lines 
of X, none of which is weakly parallel to H. Then the points di = H n Di, 
dji = H'nDi, d" = H"nDi (» € /), well-defined by S.4-9.4 and 2.4-6, satisfy 

the following condition: The scalar di<Tf /did\ does not depend o» i € /, but 
only on Conversely, if for some i we have d!" £ {di,d^) and if 

did"'/did[ is equal to this common value, then d[" = d" = H"nDi. 



(ip(H') 


|p(H") 

X/H 


Proof. We introduce the quotient space X/H (cf. 2.2.4) and the canonical 
projection p : X —* X/H, which is easily seen to be a morphism. In particular, 
2.3.2 shows that, for all t, 

^ _ p(g)p(j/»i 
pipin’) 

The converse comes from the fact that there exists a unique point c on a line 
(a, b) (6 ^ o) such that ac/ab is a fixed k E K. □ 

For another proof, see 6.5.4. 

All the rest of this section is a consequence of the following elementary remark; 

2.5.2. Lemma. Let a,b € X be distinct points and f a dilatation different 
from the identity. Set a' — f{a) and D — {a,b), and let V be the parallel to 
D through a' (cf. 2.4.9.2). If f & T{X), the point f{b) is the intersection of 
D' with the parallel to (a, a') through b. If f is a homothety of center 0, we 
have f[b) = D'n{0,b). 

Proof. The case of a translation is the famous “parallelogram rule”: ab = cd 

implies ac = bd, a consequence of the commutativity of X. If / ^ T{X), 
we consider the plane spanned by 0, a, b and apply Thales’ theorem, with the 
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lines D, D', D", where D" is the parallel to D through 0, playing the role of 
H, H', H". □ 

2.5.3. Proposition (Theorem of Pappus). Let X be an affine plane, 
D, D' two distinct lines of X. Let the points x,y,z B D and x', y' ,z' B D* he 
all distinct and not contained in DnlX. If{x,y') || (x',y) and {y,z') {| (y',z) 
then (x, z') || (x', z). 

Proof. If D, D' are not parallel, Dn D' contains a single point 0, by 2.4.9.5. 
Let / (resp. g) be the homothety of center 0 taking x into y (resp. y into z). 
By 2.5.2 and our assumptions, x' = f(i/), and g(z') = y'. But g ^ f = f ° g 
(use 2.3.3.11 and the commutativity of K), so, setting h — g o f ^ v/e have 
z = h(x) and x' = h(z'), whence (x, z') || (x', z) by 2.5.2. 



V D II D', the same proof works if we replace homotheties by translations. 

□ 

This is only one possible (affine) version of the Theorem of Pappus; for an¬ 
other one, see 5.4.2. For a converse, see 2.8.6. 

2.5.4. Proposition (Desargues’s theorem). Let {a,b,c),{a',b',c') 
be triangles in an affine space (cf. 2.4.7). Assume they have no vertices in 
common and that their sides are parallel: (a, 6) || {a',b'), (fc, c) || {b',c'), 
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(c,o) II (c'l a'). Then the three lines {a,a'), (6,6'), (c,c') are either parallel or 
concurrent. 

Proof. First, (a, 6) || {a',b') implies that the four points a, 6, o', 6' lie on the 
same plane. By 2.4.9.5, either (o, o') || (6,6') or (a, o') and (6,6') intersect 
in a single point 0. We discuss the second case. Let / be the homothety of 
center 0 taking a into o'. By 2.5.2, /(6) = 6'. Putting c" = /(c) and applying 
2.5.2 twice more, we get (6, c) || (b',c") and (a, c) || (a',c"), whence c' = c" 
by 2.4.9.2. □ 

For other versions of Desargues, see 5.4.7. 

2.5.5. Note. In axiomatic theories of affine and projective geometry (2.6.7), 
the theorems of Pappus and Desargues play an important role, and are some¬ 
times taken as axioms. In particular, Desargues’s and Pappus’ theorems are 
connected with the associativity and commutativity, respectively, of the base 
field. The interested reader can refer to 2.8.9, |AN, 73-75], and |DI, 158-160). 
See abo 2.6.7 and 4.8. 

2.5.6. Corollary (characterization of dilatations). Let f & Sx 

be a bijection of an affine space X of dimension > 2. Then f € Dil(Jf) if and 
only if, for any line D of X, f(D) is a line of X parallel to D. 


W(a) 


b«> 


«(b) 


D /(D) 


Figure 2.5.6 
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Proof. This follows from the proof of 2.5.4, by considering the dilatation /' 
defined by the points a,b, f(a), f{b), where a,b € X are arbitrary distinct 
points. The condition of the corollary implies f = f outside the line (a, 6). 
Since dimJf > 2, moving a and b around proves equality everywhere. The 
converse is trivial. □ 

2.5.7. See sections 5.4 and 5.5 for practical applications of the preceding 
results. 

2.6. The fundamental theorem of affine geometry 

This is the only delicate result of this chapter. In spite of its simple 
statement, its proof is long and involved. 

2.6.1. Introduction. We have seen (2.4.2.2) that if / e A{X\X') the 
image of every line D ot X \s a. line (or possibly a point) of X'. The converse 
is false. First of all, in the case dimX = dim X' = 1, any bijection satisfies 
this property. Next, take X = X' = and f{zi,Z 2 ) = (^ 1 , 02 ), where 
~ denotes complex conjugation. It is clear that / takes lines into lines; If 
Z) = a: + C z (see 2.4. 8. 7), then f{D) = x + C ■ z is still a line. 

Generally speaking, the fundamental theorem of affine geometry says 
that the two counterexamples just mentioned are essentially the only ones 
possible. Observe that complex conjugation is a (field) automorphism of C. 
Before stating the theorem, we introduce the following 

2.6.2. Definition. Let V,V' be vector spaces over K,K', respectively. 
A map f : V —tV'is called semiiinear if there exists an automorphism 
a : K —* K' such that 

/(Ax + fiy) = cr(A)/(i) + cr(/i)f{y) 

for all x,y & V, X, fi ^ K. If X,X' are affine spaces over K,K', a map 
f : X —* X' is called sejuiaffine if there exists a S X such that f : Xa —* 
is semiiinear. 

We have mentioned one example of a semiaffine map, taking (zi,Z 2 ) € 
into [zi,Z 2 ). The reader can, as an exercise, extend most things in section 2.3 
to the semiaffine case, and show that semiaffine maps do indeed take collinear 
points into collinear points. 

2.6.3. Fundamental theorem of affine geometry. Let X,X' be 
affine spaces of same finite dimension d > 2. Let f -. X —* X' he a bijec¬ 
tion which takes any three collinear points a,b,c e X into collinear points 
/(a),/(6),/(c) e X'. Then f is semiaffine. 

There are sharper results in [FL], for example. See also 5.4.8 and 5.4.9. The 
following proposition gives an idea of how powerful this theorem b: 
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2.6.4. Proposition. The only automorphism o/R is the identity. The only 

continuous automorphisms of C are the identity and z —* z (observe that C 
admits other automorphisms). □ 

See, for example, [FL, 88 ], exercise 2.8.10 and jPO, 48|. See also 8.12.11 for 
the automorphisms of the skew field of quaternions. 

2.6.5. Corollary. Let X,X' be two real affine spaces of same finite di¬ 

mension d > 2. If f : X —* X' is bijective and takes collinear points into 
collinear points, f is an affine map. □ 

2.6.6. Proof of 2.6.3. As we have mentioned, this is a longish proof; we 
restrict ourselves to the main ideas. See (FL, 83 ff.j for more details. 

2.6.6.1. Step 1. If the points {xv}<=o. k o*'® independent in X, their images 

are independent in X'. 

Proof. We complete {xi}i=o. k into a frame {xj}j=o. d of X. Assume the 

points {/(xi)}i=o.<i are not independent; then (/(xo),. •.,/(x^)) / X'. 

But the proof of 2.4.5 and the assumptions of 2.6.3 imply that 

f(X)c(f[xo),...,f(x,t))¥=X', 

contradicting the fact that / is surjective. □ 

2. 6 . 6 . 2. Step 2. For any line D of X, f(D) is a line of X'. If D and D' 
are parallel, so are f[D) and f(D'). 

Proof. Let D = (o, 6 ), and put 

D'= {f [a), f{b)). 

Let c' e D' and x € X be such that f(x) = c' (using the surjectivity of /). 
Then .x € (o, 6 ) by 2 . 6 . 6 . 1 . 

D' 



Now let D 11 D' be lines of X, and P the plane containing them (we can 
obviously suppose D ^ D', else there is nothing to show). Put P' = {/(P)); 
we have f(D),f[D') C f(P) C P'. By 2.4.9.5 we have f[D) jj f (D') if 
f{D) n f (D') — 0. On the other hand, if q € f{D) fl f{D'), we can find by 
the first part of 2. 6 . 6 .2 points c € D and c' € D' such that f{c) = f(c') = 7 . 
But D n D' = 9 and / is injective, a contradiction. □ 
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2.6.6.3. Step 3. The map f is additive. 

This means that /, considered as a map between the vector!alizations Xq and 
X'q,, where 0 € Jf and 0' = /(O) € X', satisfies /(x + y) = f(x) + f(y) for all 
X, y e Xo. 

Proof.- If I, y 6 Xd are linearly independent, x + y can be geometrically con¬ 
structed by using parallel lines (figure 2.6.6.3). By 2.6.6.1, f{x) and /(y) are 
also linearly independent, so the assumptions on / and repeated application 
of 2.6.6.2 show that f(x) + /(y) = /(x + y). If y and x are not independent, 
additivity follows from the next step. □ 



2. 6 . 6 . 4. Step 4. Construction of the isomorphism a •. K —* K'. 

Fix 0 & X, X € X\0 and let D = K x be the line spanned by x. Similarly, put 
0' = /(O), D' = f(D) = K'-f(x). The map cr : K —* K' is defined by diagram 
2. 6 . 6 .4.2: the restriction flu : D —* D' is indeed a bijection, by 2. 6 . 6 .2, and 
the bijections K —* D, K' —* D' are the maps A —♦ Ax and A' —> A'f(x). 
There remains to check that cr is a field homomorphism. The essential remark 
is again that (A -I- /x)x and (A/i)x can be geometrically constructed from Ax 
and fix, using parallel lines (figures 2 . 6 . 6 .4.1 and 2 . 6 . 6 .4.3), as long as we 
have a point y € X \ D. This is this case exactly because dim X ~ 2 by 

hypothesis. The figures are justified in the following way. figure 2.6.6.4.1, by 
the parallelogram rule (proof of 2.5.2), and figure 2.6.6.4.3, by the proof of 
2.5.3 and 2.3.3.11. □ 

2 . 6 . 6 .5. Step 5. The map f is semiaffine. 
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K’ 



The previous step showed that /, restricted to an arbitrary affine line of X 
through 0, is a semiaffine map, but a priori the isomorphism a D —* D' 
could depend on D. That it doesn’t is shown by figures 2.6.6.5, where D and 
Di are two lines through 0. □ 



2.6.7, AXIOMATIZATION of affine geometry. In a sense, the funda¬ 
mental theorem of affine geometry says that an affine space is essentially 
determined by its lines and their set-theoretical intersection properties. In 
other words, it is reasonable for one to expect to be able to reconstruct 
the full affine structure of a space (X, X, $) starting from X, its lines and 
their intersection properties. Such an approach, which amounts to finding 
the weakest possible axioms that must be imposed on X and its subspaces 
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in order to recover (X, X,$), is called an axiomatization of affine geometry. 
Euclid’s Elements, for example, propose such an axiomatization, though he 
adds also a metric structure, among others. There is an immense amount 
of literature about such constructions, which are still the object of research, 
generally together with axiomatizations of projective geometry. Open prob¬ 
lems arise particularly in dimension two, because then Desargues’s theorem 
does not follow from intersection axioms for subspaces. Roughly speaking, 
there exist more affine or projective planes in the axiomatic sense than in the 
sense of 2.1.1 or 4.1.1, for example, the projective plane of the division ring 
of octonions: see 4.8.3, |P0, 285], [BES, chapter 3|. 

Here are a few general references on axiomatic geometry: [AN, chap¬ 
ter 2] is a very readable and fundamental book; |BR] is very complete; |V-Y] 
is an old reference, useful for appreciating “modern geometry”. Recent refer¬ 
ences for projective planes include |DI] and [H-Pj. For a simple example, see 
|FL, 319]. 

2.7. Finite-dimensional real affine spaces 


In this whole section all vector and affine spaces are real (i.e., over R) and 
finite-dimensional. Their dimension is generally denoted d. 


Real affine spaces of dimension two and three are the objects of our everyday 
experience (without the notion of distance) and of classical geometry, as 
studied by the Greeks. Furthermore, finite-dimensional real affine spaces, 
sometimes endowed with an additional Euclidean structure, will be the main 
subject of this book. 

2.7.1. The canonical topology. We recall that a real (or complex) vec¬ 
tor space has a canonical topological structure (see, for instance, [CHI, 19]). 
Consequently, the linear group GL(E) of a vector space E of dimension d 
also has a canonical topology, induced by L[E; E), which is a vector space of 
dimension 

2.7.1.1. Proposition. Let X be an affine space, a E X. The topology 
induced on X by the canonical topology of the vectorialization X,, does not 
depend on a, and is called the canonical topology on X. 

Proof. Observe that translations of a finite-dimensional vector space are hom- 
eomorphisins, and apply 2 . 1 . 8 . □ 

2.7.1.2. For example, every morphism / e A(X; X') is automatically con¬ 
tinuous; every affine subspace Y is closed, and its complement X \ V, if not 
empty, is dense. 

2.7.1.3. If we don’t want to resort to the general result 2.3.4, we can proceed 
in the following way to find a canonical topology for GA(X): Take a 6 X, 
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write X as the semidirect product r(Jf)GAa(X) (2.3.3.8). We have isomor¬ 
phisms T(X) S X and GAa(A') ~ GL(J?), so we can successively consider 
canonical topologies on T{X), GAa(X), 7'(A’) GAa(X) and GA(yr). 

2. 7. 1.4. Proposition. The topology thus defined on GA(ylf) depends only 
on X, and is called the canonical topology. 

Proof. If / = sojf = s'og' with g £ GA„(A’) and g' € GA;,(X), we necessarily 

have s' = t -, and g' = o f. Thus s' and g' are clearly continuous at 

/(“)“ 

the point (s, g) = f. □ 

Thus GAa(A’) and T(X) are closed in GA(A’). The compact subsets of 
GA(X) will be studied in 2.7.5. 

2.7.2. Orientation. We recall briefly the notion of orientation for vector 
spaces. There are two equivalent definitions, one algebraic, using determi¬ 
nants, and the other geometric, using homotopy between bases. 

2.7.2.1. For the first definition, remark that the vector space A'^E* of 
alternating d-linear forms over a d-dimensional vector space £ is a real one¬ 
dimensional vector space; thus A^E* \ 0 has exactly two connected compo¬ 
nents. 

2.7.2.2. Definition. An orientation for E is the choice of one of the two 
components of A'^E*\0. Let this component be 0. A formal £ A'^E" is called 

positive if w & 0; a basis B = {e, },=i. 4 of E is called positively oriented 

if w(«i, ..., ed) > 0 for any ai & 0 . 

The last sentence makes sense because 01 , 01 ' £ 0 implies that 01 ' — koi for 
some k £ . 

2.7.2.3. Observe that if w € 0 and / € GL(£'), the pullback f*oi is in 0 if 
and only if det / > 0. Thus it makes sense to introduce the following notation 
(useful even if £ is not oriented); 

2.7.2.4. Notation. We put GL+(£;) = { / G GL(£’) | det / > O}, and 
GL"(£;)= {/£GL(£;) |det/<0 }. 

2.7.2.5. With this notation we have, for an arbitrary basis 0 oi E: If w £ 0 
and / £ GL"*^ (£'), then f'oi £ 0. If B is a positively oriented basis for E, the 
basis /(B) is positively oriented if and only if / £ GL"*'(.E). For this reason 
we call elements of GL'*' (E) orientation-preserving, and will do so even if .E is 
not oriented. On the other hand, if E, E' are real vector spaces of same finite 
dimension, the following definition only makes sense if E,E' are oriented: 
/ £ Isom''’(E; E') if and only if f*oi' £ 0 for any 01 ' £ O', where 0,0' are 
the orientations of E, E'. 

2.7.2.6. The second way of defining orientation is geometric, or even me¬ 
chanical; it essentially says that two bases have the same orientation if they 
can be deformed into one another; 

2.7.2.7. Definition. Two bases B,B' of a vector space E are called homo- 
topic if there exists a continuous map F : (0,1) —► E‘* such that E(0) = B, 
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f (1) = B' and F{t) is a basis for all t g [0,1]. An orientation for E is an 
equivalence class of homotopic bases. 

2.7.2.8. The only non-trivial result in this section consists in proving that 
2.7.2.2 and 2.7.2.7 are equivalent definitions. This is done by showing that 
there are exactly two equivalence classes in the space of bases, under the 
homotopy equivalence relation, and that if we choose an orientation for E 
in the sense of 2 .7.2 .2 the set of positively oriented bases is exactly one of 
these equivalence classes. Observe first that the choice of a fixed basis Bo of 
E determines a map (ji : GL(£) —* E^, 

Hf) = /(5o), 

whose image is the set of bases of E. This map is continuous, and, in fact, 
a homeomorphbm onto its image. Thus the desired equivalence follows from 
the 

2.7.2.9. Proposition. The space GL(£) has exactly two (path)-connected 
components, namely GL'*'(£^) and GL“(i7). 

Proof. There are at least two components since det : GL(.B) R* is a 
surjective continuous function. The path-connectedness of GL(.E) will be 
proved in 8.4.3. □ 

2.7.2.10. Definition. An orientation for an affine space X is an orientation 

for the underlying vector space X. We call {xj}i=o.i. d a positively oriented 

frame of the oriented affine space X if {so®<}i=i. d w a positively oriented 

basis for X. We put 

GA+(A') = {/ 6 GA(A') I f € GL+(X)}, 

GA"(X) = {/eGA(X) l/eGL-(X)}. 


2.7.2.11. One can ako define Isom'*’(X; X') if X,X' are two oriented affine 
spaces. One can talk about homotopic affine frames (whether or not X is 
oriented). In this way we obtain an equivalent notion of orientation; to see 
this we just prove that GA"*’(X) is still path-connected. 

For a € X, the group GA'*'(X) is the semidirect product T[X) GA‘''(X) 
(cf. 2.7.1.3), where T(X) is homeomorphic to X, which is path-connected, 
and GA,'J'(X) is homeomorphic to GL"*’(X), which is ako path-connected by 
2.7.2.9. 

2.7.3. HYPERPLANES AND HALF-SPACES 

2.7.3.1. Let ff be a hyperplane of an affine space X. By 2.4. 8 . 6 , there 
exists an affine form / € A(X;R) on X such that H = /~’(0). It is thus 
natural to consider the subsets /~*(R+) and /"^(R_), as well as f~^ (r;) 
and /■ -i(Rl). The first two are closed subsets of X, and the second two open 
subsets, whose closures are the first two. 
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2.7. 8 .2. Proposition. The sets f ) and f ^(R-) (resp. r^R;) 
and j *(R* depend only on the hyperplane H; they are called the closed 
(resp. open) half-spaces bounded by H. Every half-space is path-connected. 
The setX\H has exactly two connected components, /“^(R^) and /“*^(R1). 

Proof. The first assertion follows form the beginning of 2 .4. 8 . 6 . The second 
is true because half-spaces are convex sets; and the last, because X\H must 
have at least two connected components, since affine forms are continuous 
(2.7.1.2) and R* has two connected components. □ 



Figure 2.7.3.2 


Half-spaces play a fundamental role in chapters 11 and 12. 

2.7.4. LeBESGUE measure ON AN AFFINE SPACE 

2.7.4.1. We shall make free use of classical results from integration theory 
(see 0 . 6 ). 

2.7.4.2. Let £ be a vector space and / ; R** —* E a.a isomorphism, obtained 
by choosing a basis of E, for example. Let po denote the Lebesgue measure 
on R** and f[po) l^he image measure on E. If g : R'* —* E is another isomor¬ 
phism, we want to compare g(/.to) *ind f(po)- The change of variable theorem 
(cf. [GT, 33]) shows that 

g(/x„) = |det(g“^ o f)\f(po), 

where |det(g“* ° /)| is a strictly positive real number. Thus a vector space E 
does not have a canonical measure, but has a family of proportional natural 
measures f{p(i),f € Isom(R'*; .E), called Lebesgue measures on E. (This is 
the same as saying that E has a canonical measure up to scalar multiplica¬ 
tion.) 

The passage to the affine case is allowed by the observation that trans¬ 
lations of R‘* preserve po. Thus 

2.7.4.3. Proposition. A Lebesgue measure on an affine space X is the 
image, under any bijection ’. X—*X,a^X, of a Lebesgue measure 
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of X. Two Lebesgue measures on X are proportional. If pt is a Lebesgue 
measure on X, we denote by p, the Lebesgue measure on X which gives rise 
to it. For every Lebesgue measure fi on X and every f € GA(X) we have 
/(M) = |det/>. □ 

We shall see in section 9.12 that a Euclidean affine space possesses a canonical 
Lebesgue measure. 

2.7.4.4. Notes. Lebesgue measures are the only measures on an affine space 
invariant under all translations. The idea in proving this is to regularize the 
measure in question, so as to obtain a continuous invariant measure, which 
must be proportional to a Lebesgue measure. The proportionality factor, 
being invariant under translation, must be a constant. 

2.7.5. Center of a compact set. Compact subgroups of GA(X) 

In this section we use Lebesgue measures to associate to a compact subset 
K of an affine space X a well-defined point, fixed by every / e GA(X) such 
that f(K) = K. This latter property will be essential in 2.7.5.10. 

2.7.5.1. We first fix a Lebesgue measure p on X, and suppose that K has 
non-empty interior k. ^ In particular, 

/^(^) = / XKti > 0 , 

Jx 

where XK is the characteristic function of K (see 0.6). Recall that p. gives rise 
to a measure p with values in a vector space E by putting, for any f : X E 
and 4) € E*, 

*(//") 

This is the same as choosing arbitrary coordinates, in which / = (/i,..., fd), 
and writing 

Given the compact set K and an arbitrary point o G X, we set 

4(«) = / XKOxp. 

Jx€X 

If 1) is another point, 

= J XKbxp= j XK{ba^)p = p{K)ba + I^(a). 

We thus get the following 

2.7.5.2. Proposition. If K is a compact set in X with k 9, the point 

a={p{K))'^I^{a) 

does not depend on a € X or on the choice of p. □ 
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This point, called the centroid of K, shall be denoted cent'(iif), or cent-,f(/if) 
if necessary. 

2.7.5.3. Remark. If X is a Euclidean affine space of dimension 2 or 3, and 
if if is considered as a homogeneous (i.e., constant-density) plate or solid, 
cent'(if) is exactly the center of mass of K, as defined in mechanics. 

2.7.5.4. In the general case of an arbitrary compact if, possibly with empty 
interior, we generalize the definition above by using the following trick, bor¬ 
rowed from chapter 11; Take the convex hull £(K) of K and the affine sub¬ 
space (if) spanned by K. Then S{K) has non-empty interior in (if), by 
11.2.7, which allows us to state the following 

2.7.5.5. Definition. The center of a compact subset K of an affine space is 
the point 

cent(if) = cent(^j(£’(if)). 



2.7.5.6. Remark. This is no longer the center of mass considered in me¬ 
chanics (if if is, say, a curved homogeneous piece of wire in the plane or in 
space), cf. 3.4.2 and 3.7.14. Nor does it coincide with the other fixed points 
which we shall find in 9.8.6. 

2.7.5.7. Application. Let K be a compact subset of an affine space X. 
If f & GA(X) satisfies f(K) = K, then /(cent(if)) = cent(if). In other 
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words, 

GAk(X) = {/ e GA(X) I f{K) = K}c GAee„t(K)(^). 

Furthermore, if K ^ 9 we have \ det / | = 1 for any f £ GA/f (Jf). 

Proof. The last assertion is a consequence of the end of 2.7.4.3; 

= 4rHK)) = ,,(K) = I det/IMA-). 

To show the first we can, in view of 2.7.5.4 and 2.7.4.3, restrict ourselves to 
the case & ^ <6. Then 

/(cent(A-)) = f{a+{^i(K))-^I{a)) = f{a) + f{[^{K)r^I(a)) 

= f[a) + f{l[a)), 

where a € Jf is arbitrary. Using 2.7.5 .1 we get 

/(^(a)) = f( f XK^i^ = [ XKf{ax)fi^ 

\Jxex / JxeK 

= f XKf{a)f(x)fl=l{f(a)). 

Jx€K 

Thus 

/(cent(ff)) = /(a) + {^[K))''l{f{a)) = cent{A-). □ 

2.7.5. 8 . Notes. If X is affine Euclidean, one can show that every isometry 
/ € Is(A') such that f{K) = K leaves invariant a point of X which does not 
depend on /. This can be done without resorting to integration theory (see 
9.8.6). On the other hand, given a compact set K C X with A ^ 9, there 
exists a Euclidean structure on X which is invariant under GAjc(X), so in 
particular GAk(X) C Is(X) for this structure (see 2.7.5.10 and 8.2.5), 

Finally, observe that in general GAf(/f) ^ GA/f (X), as shown by figure 

2.7.5.8. 



2.7.5.9. Proposition. Let K be a compact subset of X with non-empty in¬ 
terior. The subgroup GA/f (X) C GA(X) is compact in the canonical topology 
(2.7.1.3). 
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The condition k ^ ^ vs necessary; if if = {o}, for a e X, we have GAa(X) 
homeomorphic to GL(.Y), which is not compact. 

Proof. Use 2.7.5.7 to vectorialize X at cent(ir), reducing the problem to the 
vector case. It can be shown that K contains the origin, i.e., cent(if) € K 
(2.8.11), but instead we observe that if f(K) = if for / e GL(Jf), we also 
have 

/(-if) = -if, 

where —if = { —x | x S if }, and 0 is an interior point of £ [K n (-if)) 
(figure 2.7.5.9). 



S(KU (-K) ) 

Figure 2.7.5.9 


Now consider any Euclidean structure on X, and take e > 0 such that 
B(0,e) C K. Put M = sup^gjf ||x||. Then /(if) = K implies 

||/(2:)|| < M for any x such that ||x|| < e; 

thus ((/II < e~^M in the usual norm of L(X;X). Thus GLjf (A”) is a bounded 
closed set of L(X\X), and hence compact. □ 

The converse is also true: 

2.7.5.10. Proposition. Let G be a compact subgroup of GA(A). Then 
there exists x X such that G C GA*(A’). Moreover, there exists a Euclid¬ 
ean structure on X left invariant by G (i.e., such that G C Is(A') for this 
structure]. 

Proof. Let a g X be arbitrary, and G(a) its orbit under G. The orbit is a 
compact subset of X, so we can take x = cent(G(o)). The “moreover” part 
will be show in three different ways (see 8.2.5 and 9.8.6). □ 

2.7.5.11. Corollary. All maximal compact subgroups of GA(X^) are conju¬ 
gate (in GA(X), under inner automorphisms). 

Proof. Consider the Euclidean structure on X given by a quadratic form q 
on X, and let Is,, [X, q) be the group of isometries of X (for this Euclidean 
structure) leaving a fixed. This group is compact, so 2.7.5.10 implies that 
every maximal compact subgroup of GA(X’) is of the form Isa(X’,g). The 
corollary follows from 8.1.6 and formula 1.5.3. □ 

2.7.5.12. Notes. Corollary 2.7.5.11 is included here as an aside; it is a 
particular case of a fundamental general theorem which says that all maximal 
compact subgroups of a Lie group are conjugate. See, for example, (HN, 218 
and note on page 240j. 

A result as general as 2.7.5.10 cannot be extended even to very well- 
behaved spaces. For example, take X as the sphere S'" and G = Is(X) 
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(cf. 18.5). Then G has no fixed point. For a general result in this direction, 
see 9.8.6.5. 

2.7.6. EQUIAFFINE geometry. At the end of 2.7.5.7 we hit upon the 
subgroup 

SA(Jf) = { / € GA(A-) I I det / I = 1 }, 

sometimes called the unimoduJar or special affine group. Its elements leave in¬ 
variant any Lebesgue measure on X, but SA(A') is much larger than Isom(A^) 
(for an arbitrary Euclidean structure on X), and is in fact not compact. How¬ 
ever, one can construct a fairly rich geometry dealing with properties invariant 
under SA(X); see |SK, vol. 2, p. 1-50 to 1-56), [FTl, 40-52), [BLA2), [DE4, 
volume IV, p. 367, exercise 12). For example, in 2.8.12 we have notions of 
length and curvature of curves of X that are invariant under SA(X). 

2.7.7. Differential calculus of affine spaces. The notion of dif¬ 
ferentiability can be easily extended from R'* to maps / : U X' from an 
open set U of an affine space X into another affine space X'. One way to 
do this is to remark that different vectorializations of X and X' differ only 
by translations, which are differentiable infinitely often; another way is to 
observe that the definition of differentiability uses the difference 

f(x) - f(a) - g(x-a), geL(X;X'), 

which makes sense (in X') since 

X - a — ax & Xiindf(x) — f{a) = f[a)f[x) £ jl'. 

The only thing to watch out for is that the derivative /'of / is in L{X; X'). 
A general reference for this subject k |CHl). 

2.8. Exercises 

* 2.8.1. Theorem of Ceva. Let a,b,c be a triangle in an affine plane, and 
let a' 6 (6, c),b' 6 (c, a), c' £ (a, b) be three points on the sides of this triangle. 
Prove that the three lines (a, a'), (6,6'), (c, c') are concurrent (or parallel) if 
and only if we have 

^ a'b b'c c'a ^ 

a'c 6'a c'6 

* 2.8.2. Theorem of ME^NELAUS. With the same data as in the previous 
exercise, show that the three points a',b',c' are collinear if and only if 

^ a*6 b'c c'a ^ 

a'c 6'a c'6 

Deduce the theorems of Pappus, Pascal (16.8.5) and Desargues from the 
theorems of Ceva and Menelaus. Generalize. 
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2.8.3. Let X be an affine space of dimension n over a finite field with k ele¬ 
ments. Compute #GA(X), and the number of p-dimensional subspaces 
of X. 

2.8.4. Show that the center of GL(jr), where X is a vector space, consbts 
only of K* • Idx • 

2.8.5. For X an affine space, study the maps / e GA(X) such that = Idx- 
Study the maps / such that = /. 

2.8.6. State and prove the converses of 2.5.3, 2.5.4. 

2.8.7. Let X be a complex affine space (i.e., over C). Show that X has a 
canonical topology. If ff is a liyperplane, is X\H connected in this topology? 

* 2.8.8. Show that if X is a finite-dimensional real affine space and K is a 
subspace the complement X \ K is connected if dim Y < dim X — 2. Is X \ K 
simply connected? (See 18.2.2.) 

2.8.9. Pappus and commutativity. Show that if an affine plane X, over 
a (possibly non-coiumutative) field, satisfies theorem 2.5.3 for all sextuples of 
points in X, then the base field is commutative. 

2.8.10. Prove proposition 2.6.4. 

* 2.8.11. Let X be a compact subset of a finite-dimensional real affine space 

0 

such that X ^ 0; let X be a hyperplane of X and X', X" its two closed half- 

o o » 

spaces. Assume also that H is such that X' ^ 0 and X / 0, where X' = Xn 
X' and X" = XnX". Show that cent'(X) is the weighted average of cent'(X') 
and cent'(X") with weights /-t(X') and ^(X"), or, in the terminology of chap¬ 
ter 3, the barycenter of the family {(cent'(X'),/z(X')), (cent'(X"),/i(X"))}. 

O 

Deduce that if in addition X is convex, then cent(X) € X. (See figure 2.8.11.) 

* 2.8.12. EquIAFFINE length and curvature. Let X be a real affine 
plane, and fix a basis for X. We can then define the determinant det(u, u) of 
any two vectors in X relative to this basis by writing the 2x2 matrix whose 
columns are u,v in this basis. Let c : (o,6] —♦ X be a differentiable curve of 
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Figure 2.8.11 


class C® in X. The equiafSne length of c (in X, relative to this basis) is the 
real number 

J a 

Show that c has same equiafline length as / o c for any / € SA(X) (see 
2.7.6). Show that we can reparametrize c by its equiafRne length if for every 
t we have det(c''(t), e'"(t)) 0. The equiaffine curvature is the number 

K = det(c",c''") when e is parametrized by its equiafline length; show that 
this curvature, too, is invariant under SA(A^). Find the equiafline length 
and curvature for an ellipse, a parabola or a hyperbola in X (always fixing 
a basis). In the same way that a curve in the Euclidean plane is determined 
up to an isometry by giving the curvature as a function of the arclength (see 
for example |CAR, 33], or jSK, vol. 2, p. l-lj), show that a curve in X b 
determined, up to an element of SA(X), by giving the equiaffine curvature as 
a function of its equiafline arclength. 

2.8.13. Find a polygon in an affine space, given the midpoints of the sides 
(watch out for the parity of the number of sides). Compare with 10.11.4 and 
16.3.10.3. 




Chapter 3 

Barycenters; the universal space 


In this rather technical chapter we construct for each affine 
space X a “universal” vector spaee X where X embeds as an 
affine hyperplane not containing the origin (section 3.1). In this 
model X is a vector hyperplane of X —namely, the direction of X 
in X. The construction of this universal space may at first appear 
to come out of the blue, but we hope the reader will soon be con¬ 
vinced of its usefulness. Indeed, we apply it to the study of many 
classical techniques and constructions: barycenters (section 3.4), 
barycentric coordinates (section 3.6), homogeneization of poly¬ 
nomials (section 3.3), completion of affine spaces into projective 
spaces (chapter 5) and the study of affine quadrics (chapter 15). 
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All fields considered are commutative. 


3.1. The universal space 

In an affine space X there is no intrinsic calculation with vectors, since 
X has no distinguished point (see introduction to chapter 2 and 2.1.9). It is, 
however, possible to calculate with vector fields on X, since the set of maps 
from an arbitrary set into a vector space has a natural vector space structure. 

3.1.1. Definition. A vector field on an affine space {X,X) is a map 
f : X —* X. The set of vector fields on (X, X) has a canonical vector space 
structure and will be denoted by V(X). 



3.1.2. Examples. Vector fields are generally drawn as arrows joining x to 
X + /(x), as in figure 3.1.1. We shall never consider wild vector fields as in 
this figure, but only the following two examples: 

3.1.2.1. The constant field f^, where ^ € X, is defined by /(x) = ^ for all 

X e X (figure 3.1.2.1). The trajectories of this field are lines parallel to 
Geometrically speaking, the map x i-t x + f(x) is the translation t^. 

3.1.2.2. The central Seld f[k,a\ associated with the pair (fc, a) € K* x X 
is defined by /(fc.a)(x) = kxa for any x 6 X (figure 3.1.2.2). The map 
X I—» X + /(x) is the homothety Ha.i-k, and the trajectories are open half¬ 
lines starting at the origin. See also 3.7.6. 

3.1.3. Notation. The following two subsets of 'V(X) will be used: 

C{X) = { f^ C'[X) = { f^k.a) I (k, a) e X* X X }. 
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Figure 3.1.2.1 Figure 3.1.2.2 


3.1.4. Observe that ^ * /j- defines a linear isomorphism between X and 

C{X), which is a vector subspace of V(A'). Analogously, (k,a) i—* f[k.a) is a 
bijection between K* x X and C‘(X), this latter not being a vector subspace 
of Observe also that, as elements of 'V{A'), the vector fields Xf^k.a) 

and S(xk.a) are identical, and finally that O C‘{X) = 0. 

3.1.5. Theorem. The union D(X) = C{X) UC'(X) is a vector aubspace of 
The map <f> ; P(X) — * K defined by 4>(f{k.a)) = k for [k,a) 6 K* X X 

and 4>(f^) — 0 for ^ & X is a linear form on P[X). The map x i-» f^l_x:) 
from X into D(X) induces an affine isomorphism between X and the affine 
hyperplane of P(X) (cf. 2.2.3). 

Proof. The proof is trivial, but the theorem is important. We first check that 
4> is linear: 



Fi^re 3.1.5 


3.1.6. The vector field f^k.a) + /(*'.«') takes x to kxa -f fc'xo ; thus, for 

k+k' = 0 we have fcxo+fc'x? = A:(ia—i?) = ka'a, so f[k.a) + f[k'.a') — f -f*- 

kn a 

On the other hand, if A: + it' ^ 0, there exists b such that 


ab = - -rroo' 

fc + fc' 


so that kxa + k'xa = (A: + k')xb, and /(*,„, + fik'.a') = /(fc+fc'.ft)- (The 
justification for this b comes from looking at the values of the two identical 
vector fields fik.a) + f[k'.a') and f[k+k'.b)i where fc is as yet unknown, at the 
point X = o.) 
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Similarly, one has (/(*.„) + fs){x) — kxa+ 1= kxa , where a' is such that 

^ Thus /(fc.a) + ff = fik.a')- The linearity of if> follows from these 

calculations. To show that E : x is an affine morphism, fix a e Jf. 

By 2.3.1, E is affine if and only if there exists a linear map ; X —» C[X) 
such that E(a)E(&) = E(fc) — £(a) (as vectors in P(X)). But, by definition, 
E(a)E(&) = E(5) — E(a) takes x into xb — xa = ah, so that E(&)E(a) = 
and the linear map E given by E{D = /^ satisfies the desired condition. □ 
We now forget about vector fields, and synthesize our results in the following 
(seemingly artificial) corollary; 

3.1.7. Corollary. Let (X,X) be an affine space, and let X be the disjoint 
union X U (K* x X). We identify X and 1 x X C X. Then X is a vector 
space with the multiplication given by 

k(h,x) = (kh,x) = (A:/i)x, O(fcx) = 0 

(in particular, (k,x) = kx), and the addition given by 

k' 

kx + k'x' = (k H- k')x" if k + k' 0, where x" = ^ ^ ^ x, 

kx + k'x' = kx'x if k + k' = 0, 

, kx + ^ = k(x + k” * ^ 

(and the original multiplication and addition on X C X). The set ft is a 

hyperplane of X. The map M : X —* X defined by M{kx) = k, M(D = 0 

for ^ X is a linear form on X. The offline hyperplane M“*(l) is affine 
isomorphic to X, and does not containO. For any a G X, we have a direct sum 
decomposition X = XffiXa. The restriction to X of the projection q : St —* X 
onto the first factor of this direct sum is identical with ©„ (cf. 2,1.5—recall 
that we identified X and 1 x X C n □ 



Figure 3.1.7 
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3.1.8. Remarks. Corollary 3.1.7 completely justifies the notation xy =■ y—x 
introduced in 2.1.3.1. Watch out for the dangers arising from the use of certain 
identifications; if there is any confusion, it’s a good idea to start over, carefully 
distinguishing all objects. A typical example is afforded by the affine space 
X = X (2.2.1): in X, the notation X means both the affine subspace 

X = X = M-^{1) 

and the vector subspace X (figure 3.1.8). Thus it’s a bad idea to write X = 
X\ we have to distinguish between the two when considering the associated 
universal space. 




x=x 

-•- 

0 


Figure 3.1.8 


3.1.9. The snake that bites its tail. Starting from X, we form the 
universal vector space X, which contains X as a hyperplane. According to 
2.2.5, we consider the affine space X^; then 2.2.7 and 3.1.7 show that X 
is naturally isomorphic to X^ via the two isomorphisms X = Af~*(l) and 

M-‘(l)sX^. 

3.2. Morphisms and the universal space 

Let X and X' be affine spaces. We search for the relationship between 
A(X;X') and L(X;X'). Going from L{X;X’) to A(X,X') is easy, just use 
3.1.7 and 2.3.7. The converse is given by the following proposition, whose 
proof is straightforward: 

3.2.1. Proposition. Let X,X' be two affine spaces, and f a morphism 

in A(X;X'). We define / : X —* X' by f\j^ = f, f[ks) = fc/(x) (using 
the identifications X C X, X' C X'). Then f G L(X;X'), and in fact 
f € j^,(X; X'). Using the notation of B.S.6 and the identification in 

S.1.8, we have f = f. If g ^ -^(X', X”), where X" is a third affine space, 
then g ° f ~ g ° f ■ D 

8.2.2. Thb justifies calling X a “universal” space, and shows that X —+ X 
is, in a sense to be made precise later, a functorial correspondence. 

3.2.3. Corollary. For every affine space X, the affine group GA(X) of X 
is naturally isomorphic to the quotient group GL^[X)/K* Td^ of GL^(X) = 
{ / e GL(X) I /(X) = X } by the center X* • Id^ o/GL(X) (see 2.3.3.12). 

□ 
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3.2.4. Example. With the definitions of 2.3.3.12, we have / € Dil(A^) if 
and only if /|^ = Id^. 

3.2.5. Elaborating on 2.3.8. We use the same notation and same data. 

Prom the affine frame {xi},=o. n for X we can form two bases for X. The 

first is {xo, xqXi, ..., xox„*}, where xo € X C X and xqx^ € X C X. The sec¬ 
ond is {xo, xi,..., x„} c X C X, and will be studied more closely in 3.6. For 

now we form the two bases {xq, xqXi, ..., xoXn} and {xq, XqXj, ..., Xgxl,} 
of X and X', respectively, and interpret 2.3.8 in this light. The matrix 
M(/) of 2.3.9 is simply the matrix of / (3.2.1) in these bases, for if x = 
(Ai, ..., A„) in X, the coordinates of x in the basis {xo, xqXi, ..., xox^} are 
clearly (1, Aj,..., A„), and the elements of X C X are exactly those whose 
first coordinate is zero. This proves 2.3.9. 




3.3. Polynomials over an affine space 


In this section fields have characteristic zero. 

We recall the definition and some properties of polynomials and polynomial 
maps on vector spaces. A very good reference is [CHl, 79). 

3.3.1. Definitions. Let V,W be vector spaces. A map f : V —* W is 
called a homogeneous polynomial map of degree k if there exists a symmetric 
k-linear map 4> '■ F* —* such that f = ^ o A, where A : V —> V*' is 
the diagonal map V(x) = (x,...,x). We denote by the set of 

homogeneous polynomial maps of degree k from V into W. If W = K, the 
base field of V, we call f a homogeneous polynomial of degree k, and put 
Pl^(V) = If f is a homogeneous polynomial map, the map 4> such 

that f = (f> o A is uniquely determined by f. A map f -.V W is called a 
polynomial map of degree less than or equal to k if there exist fi € T/(V; W) 
(i = 0,...,k) such that f = fi (by convention, 

constant maps from V into W). IfW = K we say that f is a polynomial of 
degree less than or equal to k. If f is a polynomial map of degree less than or 
equal to k, f is uniquely written as a sum fi, with fi 6 P*(V ;IV). The set 
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of polynomial maps of degree less than or equal to k is denoted by Pk(V',W), 
and we put Pk(V) — Pk(V-,K). 

3.3.2. Remarks. There are explicit formulas to calculate (f : K*' —» W from 
a polynomial map f :V —* W (see [CHl, 85, formula 6.3.5]). When A; = 2, we 
are treading familiar ground: the elements / of P^iY) are called quadratic 
forms, and the map ^ such that / = ^ o A is the polar form of /, equal to 

S.S.2.1 4>{x,y) = ^{f(x + y) - f(x) - f(y)) 

for all X,y € y. It was in order to guarantee the uniqueness of 4> that we 
assumed the characteristic of the base field to be zero. For quadratic forms we 
see from 3.3.2.1 that it is enough to have characteristic ^ 2; and in general the 
characteristic just has to be strictly larger than the degree of the polynomials. 

The vector spetce Pi(V) is simply the dual of V , the set of all linear 
forms on V. More generally, PkiV't^) has a natural vector space structure, 
being a set of maps into a vector space. 

The expression “homogeneous of degree fc” comes from the property 
/(Ax) = A*/(i) for any / 6 Pk^) x & V. This property by itself 

is not sufficient to guarantee that / is a polynomial, unless we assume / is 
differentiable enough times (3.7.12). 

3.3.3. Coordinates. Suppose V has finite dimension n, and let {et}i. n 

be a basis for V and x = (Aj,... ,A„) the coordinates of x in this basis. A 
map / € Pk[V-,W) can be written in two ways, both useful. The first is 

/(Ai,...,A„)= ^ .„„A‘'>. .A“", 

ai + - • + «„=* 

where the sum is taken over all integers > 0 which add up to fc, and the 
Ou,. un are fixed vectors in W (scalars if W = K). The second is 

/(Ai,..., An) = ^ ^ A,’, ■ • ■ Aj^, 

«i< - <»k 

where the .are fixed in W, and the sum is taken over sequences of 

(possibly repeated) increasing integers ii,...,ik between 1 and n. In both 
cases we have written multiplication by scalars on the right, so as to recover 
the habitual notation for polynomials when W = K. 

3.3.4. The quickest way to define polynomials over an affine space X is to 
use the associated universal space X. A more elementary definition is given 
in 3.7.11 (cf. 3.3.14). 

3.3.5. Definition. LetX he an affine space, W a vector space. A polynom¬ 
ial map of degree k on X is a map f : X —*W such that there exists f 6 P^{X) 
satisfying f\x = /■ The set of such maps is a vector space in a natural way; 
it is denoted by Pk{X-,W). If W = K , f is called a polynomial of degree 
k, and we write /’fc(X) = Pk(X;W). The symbol of f, denoted by f, is the 
homogeneous polynomial map f = /|j^. 
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3.3.6. Examples. Po(.X^;W'') is just the set of constaoit maps; Pi(X;W) = 
A(X\W), and in particular Pi{X) is the set of affine forms of X (verify this 
directly or use 3.3.14). This justifies calling the elements of P 2 [X) affine 
quadratic forms. 


3.3.7. Definition 3.3.5 is simple and quick, but often useless in practice. 
Besides, one would like to study the restriction map PI^(X) —» Ph(X). It is 
surjective by definition, but is it injective? One also wants to perform explicit 
calculations in finite dimension and study the relation between Pk[X) and 
Pk(X,i), where Xa is the vectorialization of X at a. To do all this, we fix 
a e X, and consider the direct sum X = X ® Ka (cf. 3.1.6); we work in 
Tfc(X) for simplicity. Let $ ■ K such that / = $o A. We have 

3.3.8 

k 

• • • 1 lie + ffco.) = E E n 

1=0 li< <1, .I, 


due to the symmetry of For t = 0,1,..., fc, we define 4>i : X* —+ X ly 



Each <j>i is a symmetric t-linear map, so the map fi — <j>i o A x Xa —* K 
belongs to /j*(X<,), and we have, by 3.3.8, 

fc 

3.3.10 f = 

i=0 

Thus any / € ^fc(X) is a polynomial of degree < k over X^ for any o € X. 
The converse is also true, and 3.3.8 allows us to obtain / from / in a unique 

way. Indeed, if a € X and / = /« with /,• € j’*(X„) for i = 0,1. k 

and fi = 4>i o A with 4>i : X* —* K, we can write 

3.3.11 

* /k\~^ 

^(li+<ia,..., lit+ffca) = ^ . j ^ t«^«(ll,+ a, ■ ■ • ■ li,+“)• 

i=o '*•' li<- -<1, »^lj I, 


Clearly f\x = /, for / = ^ o A, and ^ is symmetric and fc-linear. Moreover, 
for 4 ^ 0, we have 


3.3.12 


k k 

/(I + ta) = E(e+«) = E 

t=0 i=0 

= + “)) = +“))• 
t=s0 


Identifying Xa and X by means of ©a gives 




3.3.13 
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for any x E X, t €: K* and /|jj = f. 

Formula 3.3.13 expresses the fact that / is obtained from / by introducing the 
homogenizing variable t, which in practice means using 3.3.12 for the actual 
calculations. The following proposition is a consequence of the remarks above: 


3.8.14. Proposition. The restriction map 

p;:{x) 3 f ^ fix e Pk{x) 


is bijective; the inverse map is given bp S.S.IS. The elements of Pk{X) can 
be characterized as the maps f : X —* K such that f € Pk(Xa) for any a & X 
(or for some a G: X). Under the identification afforded by the symbol f 
is simply the degree-k part of f. □ 


3.3.15. Example. Let X have finite dimension n and let {xi}j=o.i.n be 

a frame for X, with associated coordinates (Ai),=o.n- Then / g P 2 {X) if 

and only if there exist a,_,, bj,c G K (t, j = 1,...,n) snch that 

/(All... 1 A„) = 'y aijXiXj + ^ ^ biXi + c, 

i.} i 


and then f is given by 

/(•All. •. I An, t) = ^ ^ o^jXiXj + ^ ^ 6* Aft + ct . 

t.y t 


3.4. Barycenters 

Embedding an affine space X into the vector space allows us to form 
linear combinations like X}, AjX,-, where x,- € X and X, G K. A priori, the 
result is just in X, but it will in fact be in X if, in the notation of 3.1.6, 

MfeA.xi) =^A.M(xi) = 5^A< = l 

This gives rise to the following 

3.4.1. Definition. Let {x,}je/ be a family of points in an affine space 

X, and {A,}je/ a family of scalars such that A,- = 0 except for finitely many 
indices and that A^ = 1. The barycenter of the points x,- with masses 

Aj is the point x of X given by 

X = ^A.xi. 
i€I 

3.4.2. Examples. If Ai = 1, Ai = 0 for i / l, then X = Xi. H / = {1,2} 
and Ai = A 2 = I (thus the characteristic of iif is necessarily ^ 2), we find 
X = (xi+X 2)/2; this point is called the midpoint of xj and X 2 . More generally, 
if I — (1,... n} and Aj = 1/n for all 1 < x < n (thus n ^ 0 in K), the point 
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X — (ii H- ■ ■ • + x„ )/n is called the center of mass, or equibarycenter, of the 
Xj. Observe that if X is an affine Euclidean plane, the equibarycenter of a 
triangle {xi,X 2 ,X 3 } (cf. 2.4.7) made up of three points Xi,X 2 ,X 3 coincides 
with the center of mass (in the mechanics sense, that is, the centroid) of a 
triangular homogeneous plate defined by the convex hull of these three points 
(figure 3.4.2.1); but the same is not true any longer if we consider the center 
of mass of four points 

Xl + X2 + X3 + X4 


4 


and the centroid of the homogeneous plate defined by them (cf. figure 3.4.2.2, 
exercise 3.7.14). 




Figure 3.4.2.1 


Figure 3.4.2.2 


Figure 3.4.3 


3.4.3. A FUNDAMENTAL EXAMPLE is when X = R, X is a real affine space 
and X, y are two points of X. The segment with endpoints x,y is the subset 
|x, yj = { Ax+ (1 — A)y | A € (0, ij } of X. This is the foundation of the notion 
of convexity (chapters 11 and 12). 

3.4.4. If S’**" 53.- lie in X \ X. We can still work 

with it, though, if we pull it back to X by dividing it by the scalar Ai, as 
long as this is not zero. 


3.4.5. Definition. An element of X ~ X (X* x X) is called a punctual 
mass, and written as (A,x), where x € X t/ A = 0 and x £ X otherwise. 
A punctual mass (A, x) is also referred to as the point x with mass A. If 

{(Ai, Xi)}^_j,.^ ^ is a family of punctual masses, the barycenter of this family, 

also called the barycenter of the with masses A^, is the point (^^ A, , i) of 
X, where x is the vector ^ X »/ Aj = 0 and the point 


S. Xj;. 

Ei Ai 


eX 


otherwise. 


3.4.6. Remarks. 

3.4.6.1. This definition coincides with the mechanics notion of center of mass 
for punctual masses when Ai / 0. For example, the center of mass of three 




78 


Chapter 3. Barycenters; the univerBal space 


points xj, each with mass 1 , is the point (xi + X 2 + xsj/S, but it must be 
ascribed the mass 3, obtained by adding the masses of the x,-. 

3.4.6.2. H the family consists of two punctual masses (— 1 , j/)} we 

recover the notation yx = x — y; cf. 2.1.4 and 3.1.7. 

3.4.6.3. An equivalent, more classical definition employs the sums Ajj/x^, 
concealing the use of vector fields. The barycenter of a family {(Ai,Xj)}, 

A; / 0 is then the point x satisfying 

3.4.6.4 y^A.^= fy^Ajjyx 

fw all y € Af, and in particular 

3.4.6.5 AjXXj = 0. 

t 

3.4.6.6. The explicit calculation of the barycenter x of a family {(A,- 
(E.A./ 0 ). is performed by vectorializing X at an arbitrary point a € X: 


X = a + 


X).-A, diJ 


In particular, if Af is finite-dimensional and the x,- have coordinates (xji. 

Xj„) in a fixed affine frame, the coordinates of their barycenter in the same 
frame will be 

Ei Ei ^i^in 

E.A, ’•••’ E.A. 

3.4.7. The associativity of addition in X has several consequences for the 
barycenters, giving rise to properties with an intuitive mechanical interpre¬ 
tation and nice geometrical applications. 


3.4.8. Proposition (associativity of barycenters of punctual 
Masses) . Let I be a finite set, / = /j U- • ■ U /,, o partition of I, {(Ai, 

a family of punctual masses. The barycenter of this family coincides with the 

barycenter of uihere, for each I, the punctual mass 

is the barycenter of {(A,, x,)} D 

3.4.9. Proposition (non-associativity of barycenters in X). Let 

I be a finite set, / = /i U • • U /,, o partition of I, {xj}ig/ a family of points in 
X and {A,},g/, families of scalars such that A* = 1 for all I = 1,... ,p. 

Let Qj,..., a,, € if be such that Ej oq = 1- ff \i is the barycenter of the x,- 
with masses Aj (t € ij) and $ is the barycenter of the (1 = l,...,p) with 
masses Qj, then ^ is the barycenter of the x,- (i € 7) with masses o/A; (i € /(). 

□ 


3.4.10. Classical geometrical consequences. We start with three 
points ii,X 2 ,X 3 of X, and put I = {1,2,3} = {1} U (2,3), Ai = 1, A 2 = 
A 3 = ai = |, 02 = I (in particular, the characteristic of if is not 2 or 
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3). Then = x\, and ^2 = {^2 + is the midpoint of {x 2 ,X 3 }. Thus 

the barycenter x of Xi, X 2 , X 3 with masses {l/3,1/3,1/3}, i.e., their center of 
mass, is the barycenter of xi, (x 2 + X 3)/2 with masses 1 and 2 , respectively. 
In geometrical terms, the three medians 

/ X 2 + 2:3 \ / 2:3 + 2:1 \ / 2:1 + X 2 \ 

must be concurrent; their common point | is the barycenter of the triangle 
{xi,X 2 ,X 3 }, and it divides each median in segments whose ratio is 


XI 



Considering now four points (again char/f 2,3), we obtain seven con¬ 
current lines: three joining the midpoints of opposite sides (including diag¬ 
onals), and four joining a vertex to the center of mass of the other three 
(figure 3.4.10.2). 

3.5. Relationship with affine maps and subspaces 

3.5.1. Proposition. Let X,X' be two affine spaces. A necessary and 
sufficient condition for a map f : X —* X' to be affine is that f preserve 
barycenters, i.e., that for all finite families {x^j^g; C X and {A,},g/ C K 
with 52^ A, = 1 we have 

Proof. Necessity follows from 3.2.1 and 3.4.1. To prove sufficiency, we have to 
show that / : > AT} (a) is a linear map. But if x, x' € Af and A,p 6 if, the 

point x" = Ax + px' in is none other than the barycenter of {(a, 1 — A —p), 
(x, A), (x',/i)}, and similarly for f[a), /(x) and f(x'). □ 

The next proposition can be shown by the same argument: 

3.5.2. Proposition. Let X be an affine space. A subset S C X is a 
subspace of X if and only if it contains the barycenter of any family of points in 



80 


Chapter 3. Barycenters; the universal space 



Fig 3.4.10,2 




Figure 3.5.3 


S, i.e., if for all finite families {A,},gj C K and C S with A, = 1 

we have 

XiXi e 5. □ 

i 

3.5.8. Corollary. If S is a subset of X, the subspace {S) spanned by X 
is the set of barycenters of points of S (for all possible choices of masses). □ 
Thus the line D = (x, y) spanned by two distinct points i, y on it is simply 

= (a:, J/> = { Ax + (1 - A)y I A e if }. 
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3.5.4. Proposition. Let X be an affine space over a field of characteristic 
zero. For any finite subset F C X there exists a point x & X such that 
GAfIA") C GAj;(X). In other words, any f € GA(X) such that f{F) = F 
fixes X. 

Proof, li F = take x = (xi H-1- x„)/n and apply 3.5.1. □ 

This can be extended to the case of F compact: see 2.7.5.7 and 9.8.6. Exercise 
3.7.3 provides a nice corollary. 

3.6. Barycentric coordinates 

3.6.1. We will exploit here the idea, introduced in 3.2.5, of associating to 

an affine frame {x,},-,=o.i „ of the affine space X the corresponding basis 

()_i.„ of X. 

3.6.2. Proposition. Let {x,}<=o.i.n be a frame for an affine space X. 

For any x & X there exist A, € if (t = 0,1,..., n) such that = 1 

X = J^jAjXi. The scalars A, are uniquely defined by this property and are 
called the barycentric coordinates of x in the frame {x,}i=o j. 

Proof. As an element of X, x can be written in a unique way as A^x,. By 

3.1.6, 5^, Aj = 1 because x X C X. □ 



3.6.3. For example, if n = 2, the points with coordinate Aq = 0 in the 
frame {x(),X],X 2 } are those which belong to the side (xi,X 2 ) of the triangle 
{xo, xi, X 2 }. (See figure 3.6.3 for the case if = R; the shaded part corresponds 
to points with coordinates Ai > 0 for f = 0,1,2.) This remark will be relevant 
to the study of convexity, see 11.1.8.4. 

3.6.4. Barycentric coordinates will be used in 10.6.8. 
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S.6.5. BarYCENTRIC subdivision. The barycentric subdivision of a sim¬ 
plex {xq,. .-iXj} in a real affine space of dimension d is defined by induc¬ 
tion on d. For d = 1 , it is just the set consisting of the two simplices 
{xo,(xo + Xj)/2} and {(xo + Xi)/2, X|}. For d = 2, it is the set of six 
triangles having one vertex at (xo -f xi ■+ X 2)/3 and opposite side equal to 
each simplex in the barycentric subdivisions of the faces of {ii, X 2 , X 3 } (figure 
3. 6 .5.1). The generalization is obvious. 




xi 

-o 


2 


xo 



Figure 3. 6 .5.2 shows the barycentric subdivision of the barycentric sub¬ 
division (i.e., the second barycentric subdivision) of consists 

of 6 X 6 = 36 triangles. See exercise 3.7.8 about iterated subdivbions. 



Figure 3.6.5.2 


This application of barycenters is unportant in algebraic topology: see, 
for example, [GG2, 60-68], |H-Y, 206-209], or jCA, 82-86]. 
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3.7. Exercises 

3.7.1. Is Proposition 3.5.4 still valid in prime characteristic? 

3.7.2. Extend the geometric results in 3.4.10 to sets of five and six points. 

3.7.3. Let X be an affine space over a field of characteristic zero. Show that 
every finite subgroup of GA(Jf) has a fixed point. 

3.7.4. Let X be a finite-dimensional real affine space, K a compact subset 
of X with non-empty interior. Suppose K — Ur=i where ^ 0 and 
Ki n Kj has measure zero for all i ^ j. Put = cent'(iifj), ^ = cent*(X) 
(cf. 2.7.5.2). Prove that ^ is the barycenter of the with masses fJt(Ki). 



3.7.5. Let X be an affine space. Give an intrinsic definition (i.e., one that 
uses neither a vectorialization X„ nor the universal space J?) for Pi (X) and 
p 2 (X). Find a formula for the symbol /*of / € Pi(X) and of / e P 2 (X^)' 

3.7.6. Why did we take the vector field x >--♦ fci2, instead of x >-* kax, in 
definition 3.1.2.2? 

3.7.7. Let X be an affine space, a,b & X, f & Pk{X). Put / = Ef=o /• ™ 

X,j, with fi e Fi’(X„), and / = in with g, G P*[X,,). Fixing an 

affine frame for X, write an explicit expression for the g,- as a function of the 

/, and of ^ = ofc, first for fc = 1, then for k — 2. 

* 3.7.8. Let E be a simplex in a Euclidean affine space of dimension n; let d be 
its diameter (cf. 0.3). Show that all simplices of the barycentric subdivision 
of E have diameter less than or equal to nd/{n + 1); deduce that when we 
iterate the process of barycentric subdivision, the diameter of all simplices 
tends towards zero. 

3.7.9. Generalize formula 3.3.2.1 for k = 3. 

3.7.10. Let dimE = n. Show that, for all fc > 0, 

/n + k — 1\ /n + k\ 

dim P:{E)=i^ k j, dimF,(E)=(^ k j- 

* 3.7.11. Give a direct proof of the fact that if / : X —► W belongs to 
Pk(Xa', W) for some o G X, then / G Pfc(X(,; W) for any i G X. 



84 


Chapter 3. BarycenteTs; the universal space 


3.7.12. Let X be a real vector space and / : X —» R a map such that 
/(Ax) = A*/(x) for all X 6 X and A 6 R. Show that the derivative /' of 
/ satisfies the Euler identity /'(i)(i) = kf{x) for any x e X. Write and 
prove an analogous formula for the p-th derivative of /, when / is of class 
C*' and again homogeneous of degree k. Deduce that if / is of class and 
homogeneous of degree k, it is necessarily a polynomial. 

* 3.7.13. Determine the center of mass of the physical object consisting of 
three homogeneous pieces of wire of same linear density, lying on the three 
sides of a triangle. Give a geometrical construction for this point. 

* 3.7.14. Give a geometrical construction for the centroid of a homogeneous 
plate in the shape of a quadrilateral. Compare this point with the center of 
mass of the four vertices (figure 3.4.2.2). 

* 3.7.IS. Using the notation of 3.3.1, show that for every / e 
have 

y=i ■■ 

* 3.7.16. In a real affine space, consider p points xi.i,..., Xi ,, (p > 2). For 

i =1, 2, ..., p denote by ij., the center of mass of the (*i.j)j>s.. Define by 
recurrence the center of mass Xfc+i.i of the for all A: > 1. Prove 

that every sequence (x^.j-j^gN converges. What can you say about the limit 
of these sequences for different values of t? 

3.7.17. Prove the theorems of Ceva and Menelaus (2.8.1 and 2.8.2) using 
barycenters. 

3.7.18. Show that the midpoints of the diagonals of a complete quadrilateral 
(see figure 3.7.18) are colliiiear. Compare with 6.4.4 and 17,16.2.1. 



Figure 3.7,18. 


Chapter 4 

Projective spaces 


This chapter is devoted to projective spaces. It starts with 
an introduction (section 4.0), meant to lessen the shock of a 
somewhat sudden definition in 4.1. Section 4.1 also contains a 
number of examples. The aim of the rather long sections 4.2 
and 4.3 is to give the reader a concrete idea of what projec¬ 
tive spaces are, and to show that projective spaces now play 
a role that transcends the framework of elementary geometry, 
where they originated. Section 4.2 describes natural coordinate 
systems; section 4.3 studies finite-dimensional real and complex 
projective spaces, including their topological properties. Keep¬ 
ing in mind the relatively elementary level of this book, we have 
not included proofs of results from algebraic topology, but they 
should, together with the references mentioned, be enough to 
convince the reader of the continuing importance of projective 
spaces in mathematics (though the focus of interest has changed 
somewhat over time). 

The following sections present a classical, and consequently 
terse, exposition of projective morphisms and subspaces, to¬ 
gether with their elementary properties. In the last section (4.8) 
we say some words about the non-commutative case. 
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Base fields are assumed commutative, unless we state otherwise. 


4.0. Introduction 

Here are some notes to motivate the introduction of projective geometry 
and show that definition 4.1.1 doesn’t just come out of the blue. 

4.0.1. Affine geometry is not satisfactory under certain points of view: the 
intersection of subspaces, for example, gives rise to numerous exceptions, as 
attested by the statements of the results in 2.4.2.4, 2.4.9.2 and 2.4.9.4. One 
would like to have a geometry where the intersection of two subspaces b 
always a subspace, and where the relation dim(5 U T) = dim 5 + dimT — 
dim(5 n T) is always satisfied. 

4.0.2. Desargues was the first (and led the field by about two centuries) 
to construct the projective completion of a real affine plane by adding to 
this plane points ‘at infinity”, one for each set of parallel lines in the plane 
(in the completion, these lines all pass through the point they define). Thb 
construction will be the object of chapter 5. 



4.0.3. It b natural in geometry to consider the set of lines passing through a 
fixed point, or, alternatively, the set Gea of one-dimensional vector subspaces 
of a given vector space E. Such a set appears, for example, in the study of the 
tangent to a curve at a point (cf. 4.3.4). The definition of projective spaces 
given here (4.1.1) uses precisely these Gb.i- 

4.0.4. Our eyesight b not plane, but conical, as all rays converge to the center 
of the eye. It also poses the problem of combining two images obtained from 
different centers; this is connected with the notion of perspective, and plays 
a role in aerial photography, for example (see section 4.7). 

4.0.5. Projective spaces provide a nice and historically relevant way of trans¬ 
lating linear algebraic concepts, by associating a quadric to a quadratic form, 
for instance (see 14.1). Further, real and complex projective spaces play a 
fundamental role in differential geometry, algebraic topology (cobordism, for 
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instance), and, sure enough, in algebraic geometry. Part of their interest is 
due to their being the simplest compact manifolds, except for spheres. For a 
modern discussion of these subjects, see for example [GG2], [HU], |B-H|. 

4.0.6. Projective spaces arise naturally in quantum mechanics, cf. |C-D-L, 
219], for instance. 

4.1. Definition and examples 

4.1.1. Definition. Let E be a vector space. The projective space derived 

from E, denoted by P(E), is die quotient of E\0 by the equivalence relation 
“x ^ y if and only if y = Xx for some X & K”. The dimension of P(E) is 
dim E — 1. The canonical projection is p : E\0 P(E). 

4.1.2. Remarks. The definition of the dimension will be justified in 4.2.1 
and 5.1.3, but it is quite natural, since P{E), being a set of lines of E, should 
have dimension one lower than E. 

Definition 4.1.1 can bother the purists, since it hitches up to a projective 
space the vector space from which it derives. A possibly more satisfying, but 
also lengthier, definition is found in [B15, 337); see also 4.8.3. 

There are axiomatizations of projective geometry, as for affine geometry; 
see references in 2.6.7. 

4.1.3. Examples. 

4.1.3.1. For every integer n > 1 we set P”(/f) = and call this the 

standard projective space of dimension n over the field E. 

4.1.3.2. A projective space is called real if E = R, and complex if E = C. 

4.1.3.3. Zero-dimensional projective spaces are called points; one- and two- 
dimensional projective spaces are called projective lines and planes, respec¬ 
tively. 

4.1.3.4. There is a natural bijection between the projective space P{E) and 
Gb.i (cf. 1.2.5); the two spaces are often identified. 

4.1.3.5. By 2.4.8.1 and 2.4.8.6, the hyperplanes of a vector space E are in 
one-to-one correspondence with the points of P(E*), where E' is the dual of 
E. We denote by K (E) the set of hyperplanes of E\ thus there is a bijection 
IC(E) t-* P(E*), which serves to identify the two spaces. 

4.1.3.6. Let Pk(X) be the vector space of polynomials of degree k over an 
affine space X (3.3.5), and let N be the map 

N : Pk[X) 3 f ^ r^O) C X. 

Since (A/)“*(0) = /~^(0) for all A € E*, the map N can be factored by the 
equivalence relation ~ of 4.1.1, giving rise to a map jV from P[Pk(X)) to the 
set of subsets of X: 

Pk(X)\0 P{Pk(X)) 

N \ / 

subsets of X 
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We shall see that in certain cases JV is injective (see 14.1.6). For A; = 2, the 
image of jV is, by definition, the set of quadrics of X. 

Similarly, if K is a vector space, by remarking that /(Ax) = A*'/(x) for 
/ € P^(V) (see 3.3.2) we are led to consider the diagram 

(i. |g 

subsets of V —> subsets of P(F) 

since each /“*^(0) is, in fact, a cone in V. We shall meet these two diagrams 
again in chapters 14 and 15. 

4.1.3.7. Let if be a finite field with k elements, and P(E) a projective 
space of dimension n over K. Then =f^P{E) = (fc”'*'^ — l)/(A: — 1) because 
_ ^n+i each line has fc —1 elements, after deleting the origin. For 
example, every projective line over Z 2 has three points, and every projective 
plane seven points. 

4.2. Structure of projective spaces: charts 


4 . 2 . 1 . Let P{E) be a projective space. If if C P is a hyperplane, one can say, 
in the notation of 2.2.5, that Efj = Ge.i \Gjf.i = P(E) \P(H) is a subset of 
P{E), and constitutes, in fact, most of P(E). But Ejj C P(E) has a natural 
affine structure, so that, except for the points of P{H), the points of P{E) 
form an affine space. This makes it much easier to perform calculations, use 
frames, and so on. We call the bijection P(E) \ P[H) —» Ejj a chart for 
P(E). 

In order to parametrize the whole of P{E), each point must be in the 
domain of a chart. Suppose that P{E) has dimension n, and take a family 
{ift}t=() n of hyperplanes of E such that Pit i^i = 0. Then Hi = 0, 
so P{E) = \J,{P(E) \ F(if,)) is indeed the union of the domains of the 
corresponding charts. We say we have an atlas for P[E). 

4.2.2. In order to be able to use an atlas, we must know how to pass from 
one chart to the other. In other words, we must calculate the broken arrow 
in the diagram below; 

P(E)\{P(Hi)uP{H,)) 

y \ 

Eh, -* Eh, 

Here i ^ j and the broken arrow indicates that the map is not defined on the 
whole of Eh, ■ We shall work out this calculation in 4.2.4. 

4.2.3. A simpler idea for calculating in P{E) when dim£' < 00 is to take 

a basis {e,}j=o.i n of E. Every m € P(E) is of the form m = p(x) = 

p(io,..., In), where x = (xo,...,2:n) in the basis being considered. We say 
that (xq, Xi,..., Xn) is a set of homogeneous coordinates of x (with respect 
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to the basis {ei}i=o.i.n)- The word “homogeneous” comes from the fact 

that the sets of homogeneous coordinates of a point x € P(E) are all of the 
form (Axo,..., Ax„), for all A € K* and fixed xo,..., x„. 


4.2.4. We now blend the two viewpoints presented in 4.2.2 and 4.2.3. Take a 

basis {ei}i=o.i.n of E, and define the hyperpanes Hi as Hi = x“’(0). The 

points of P(E) \ P{Hi) are those whose homogeneous coordinates (xq, Xi,..., 
x„) satisfy ij ^ 0. On the other hand, by 2.2.7, Eh, is isomorphic to the affine 
hyperplane xj”^(l) = Hi + e,, parallel to Hi, and which has an affine frame 
{e,}u{e, + ey}y^,. In this frame we can express the chart P[E)\P[Hi) — * Eh, 
as the bijection tt, : P(E) \ P{Hi) — * defined by 


4 


.2.4.1 

TTi ; P{E) \ P(Hi) B p(io,xi,...,x„) 



e 



The calculation of xy o * is now immediate. We have, for j > i, 
4. 2. 4. 2 7r"'(vi,...,w„) = p(ui, ..., Vi-i , 1, v,,..., v„). 


and 
4.2.4.3 


. , f 'Jj-l \ 

^ > • • • 3 “^n ) ( >'’’3 J > 3"*'3 3***3 ll 

\Vj-i Vj^i Uj-i wy-i/ 

where xy o x”^ is defined on ff” \ iri[P{Hj)) = v~*(0). 


4.2.5. Example: E = K^. This is an important case because P(K^) = 
P^K) is the simplest non-trivial projective space over K, the projective line. 
We naturally take {eo, ej} to be the canonical basis of E^] thus 4.2.4.2 reduces 
to the map K* B v l/v G K*. 

One way to interpret this is by saying that P^[K) is obtained by gluing 
together two copies of K by the map u *-* 1/w on if*. Don’t be surprised to 
recognize this as one definition of the Riemann sphere (for K = C^); we will 
meet it again several times, in 4.3.6, 10.8, 16.3.9 and 20.6. 
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Another interpretation is that P^(K) is the union of K, embedded in 
F* [K) via the map tTq * : w —> p(t;, 1), with the point p(l,0); from this point 
of view, P^{K) is the completion of K, obtained by adjoining one point at 
infinity, here p(l,0). We shall discuss this point of view in more detail in 
5.2.3. 

4.2.6. Notes. The maps jTj o7r~* ; t;”*(0) — » belong to the most regular 
class of maps, after linear and afhne maps; they are rational functions. If 
if = R or C they are continuous, C°° (which makes sense since they are 
defined on an open set of if”), and even , i.e., real or complex analytic, 
as the case may be. Thus all kinds of topological, differential and analytic 
notions make sense on real and complex projective spaces. In the language of 
manifolds, this means that real and complex projective spaces are topological, 
C°° and even manifolds. 

Finite-dimensional complex projective spaces are the natural habitat of 
algebraic geometry. Interesting references are |TM, 190-191], |DE5| and |GR- 
HA2]. The topology of real and complex projective spaces is taken up again 
in the next section. Their orientability is treated in 4.9.4. 


4.3. Structure of projective spaces: topology and 
algebraic topology 


In this section we deal only with finite-dimensional real or complex projec¬ 
tive spaces. 


A finite-dimensional vector space E over R or C has a canonical topology 
(cf. 2.7.1). This gives rise to a canonical topology on P[E): 

4.3.1. Definition. The canonical topology of a pTojective space P(E) is 

the quotient of the canonical topology of E \ 0 by the equivalence relation 
~ of Projective spaces will always be considered with their canonical 

topology. 

4.3.2. Lemma. If H is a hyperplane of E, the bijection Eh —* P{E) \P(H) 
(cf. 2.2.5) is a homeomorphism from Eh (cf. 2.7.1.1) into P[E) \P{H) with 
the induced topology. 

Proof. In coordinates, this bijection is the map given in 4.2.4.2, which is 
obviously continuous. Its inverse 

7r„ : p{vi,...,Vn,l) (vi,...,v„) 

is also continuous, by the properties of the quotient topology. □ 

Thus the canonical topology of P{E) is the same topology it has as a manifold 
(cf. 4.2.6). 
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4.3.3. Proposition. The space P[E) is Hausdorff, path-connected and 
compact. 

4.3.3.1. From the geometric point of view, the first property is a consequence 
of 4.3.2 and the fact that affine spaces are Hausdorff, since for any two points 
m, n e P{E), there exists a hyperplane H of E such that m, n 6 P{E)\P(H) 
(figure 4.3.3.1). 



A more algebraic proof goes like this: Let A^E be the second exterior 
power of E, and introduce the map 

a : (E\0) X [E \ 0) 3 (x, y) >-* X A y e k^E. 

This is a bilinear, hence continuous map, and its kernel a“^(0) is exactly the 
graph of the equivalence relation Since ci(“^(0) k the inverse image of 0 
by a continuous map, it is a closed set, so the quotient P(E) = (E\0)/ ~ k 
indeed Hausdorff. 

If you don’t know any exterior algebra, you can skirt the more esoteric 
notation in the previous paragraph and define a in coordinates with respect 
to a fixed bask of E by 

tt((2:i,--.,x„),(yi,...,y„)) = (xiVz - 2:21/1, •••, x„_iy„ -x„y„-i)- 

Here the image of a lies in (which k isomorphic to k^E), and each 

component is of the form x, yy — Xj-y,-. 

4.3.3.2. To prove the remaining two properties, look at £ as a vector space 
over R (whether K = R or C), and consider on E any Euclidean structure. 
Let S[E) = {x € E I ||x|| = 1} be the unit sphere in E. Since p(x) = p(x/||x||) 
for any x € E \ 0, we have 

p(5(E))=p(E). 

Now S{E) k compact (18.2.1) and (unless dim E = 1) path-connected, so 
P(E) is compact (being Hausdorff) and path-connected (since P(E) has only 
one point when dim E = 1). One can also work as in the geometrical proof 
of Hausdorffness. □ 
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S{E) ^ E\0 

l> \i 1/ p 

P[E) 


4.3.3.3. Remark. This proof also shows that P{E) is homeomorphic to the 
topological space S{E)/ where a: ~ y if and only ify = ±iforff = R and 
y = ux (u e U = {2 e C I |z| = 1}) for K = C. 


4.3.4. Application. Since P(X) is identified with the space of lines going 
through a fixed point a of the affine space X, we see from 4.3.3 that the set 
of secants joining a to other points of a continuous curve C going trough a 
has at least one accumulation point (figure 4.3.4). This is a good candidate 
for the tangent to C at a. 



Figure 4.3.4 


4.8.5. Note. One can define a topology on P(E) for base fields other than 
R and C, for example, when if is a locally compact field. In this case 4.3.3 
is still valid. 

4.3.6. Proposition. Tke space P’{R) is homeomorphic to the sphere S^, 
and P^[C) is homeomorphic to the sphere . 

4.3.6.1. Identify R^ with C, so that — 5(R^) = 5(C); we have P*(R) = 

p(5^). Take the squaring map c : 5* 9 2 2 ^ £ 5*, which is well-defined 

since the square of a complex number of absolute value 1 also has absolute 
value 1. Since p(—z) = p(z) and [—z)^ = , we see that c can be factored 

as c o p, and c : P^(R) —* 5^ is still continuous. It is also bijective and 5* is 
compact (cf. 4.3.3), so c is homeomorphism. 

5‘ P‘(R.) 

C ^ 1 /^ c 

s' 

4.3.6.2. Identify with R^ endowed with its canonical Euclidean structure, 

i.e., j|(2,2')||^ = jzp -f jz'p for (z,z') € = R^. Consider the sphere 

= 5(R^) = 5(C^) and the sphere = 5(R®), where R^ is identified 
with C X R. As in 4.3.3.2, we have 

p(5®) = P'[C) = P{C^). 
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Introduce the magic map H : defined by 

4.S.6.3 H{(z, z')) = ( 222 *, \z\^ - \z'f) 

(we leave it to the reader to verify that {z,z') 6 5® indeed maps inside 5^). 
Now for m, m' € we have p(m) = p(»w*) if and only if m' = Am for some 
A 6 C, and we then must have |A| = 1. But H[[Xz, Xz')) = H[(z,z')) for 
|A| = 1, so ff can be factored as ^op, and H : P*(C) —> 5^ is easily seen to 
be continuous, bijective, and thus a homeomorphism since P*(C) is compact. 

53 _?L, pi(C) 

H \ 

52 

4 . 3 . 7 . Note. The map H : is called the Hopf Sbmtion, and is 

fundamental in geometry. The inverse images ff~*(s) of points s € 5^ are all 
homeomorphic to the circle 5^; this result, a consequence of 4.3.6.2, has al¬ 
ready been mentioned in 1.2.9. These circles in 5® form a structure connected 
with Clifford parallelism, a topic dealt with in detail in 18.8. In addition, the 
map P : 52 —» 52 and its generalizations are essential in algebraic topology; 
see for example [GG2, 15lj, (H-W, 387] and |HU, chapter 14|. 


4.3.8. The heuristics of the Hopf FIBRATION. As in 4.2.5 and 5.2.3, 
we write P^(C) = C U oo, where oo is the point p(l,0) and C the set of 
p(a:, 1). We suspect that C U oo = R2 u oo, the one-point compactification 
of C, is homeomorphic to S^. The confirmation comes from the map known 
as stereographic projection (figure 4.3.8.1; for details see 18.1.4), which here 
can be written as 


4.3.8.1 


C 9 X I—* 




M" + i/ 


e 5" 


where we still have C = C x R. 



Figure 4.3.8.1 
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In homogeneous coordinates we have 

z = p(^,l) =p(u,t;), 

which leads to defining z = u/w. If we replace z by u/w in 4.3.8.1, keeping in 
mind that |up + |wp = 1 for (u, w) € 5®, we get exactly 4.3.6.3. 

4.3.9. The topological characterization of P"(R) and P"(C) for n > 2 is 
more difficult, and lies in the realm of algebraic topology. Here we limit 
ourselves to talking about some particular cases and giving references. 

4.3.9.1. The first special case is P^(R). This space is non-orientable (cf. 4.9.4 
and 4.9.5), so it cannot be embedded in R® (without self-intersections or 
singularities). This means there is no hope of visualizing P^(R) as a subset 
of R^. There are immersions of P^(R) in R^; the images of two of them are 
given in figures 4.3.9.1.1 and 4.3.9.1.2 (see |H-C, 313-319], for details of these 
immersions). The surface on the right is called Boy’s surface. 



There are embeddings of P^(R) in R” for n > 4; a particularly beautiful 
example in R^ is the so-called Veronese surface, which arises from the map 

R^ 3 (x, y, z) -» [x^,y^, z^, y/2yz, \/2zx, y/2xy) e R®. 

The restriction of this map to has values on the (five-dimensional) affine 
hyperplane Uj = 1 of R®. It is an even map, and the quotient P^(R) —» 
R® is an injection, hence a komeoniorphism into its image. 

It is also possible to see P^(R) as a Mobius strip whose boundary, a 
single curve F, has been collapsed to a point, or, better yet, identified with 
the boundary of a disc F' (figure 4.3.9.1.3). 
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4.3.9.2. The projective space P^(Il.) is homeomorphic to the group 0'^(3) 
of rotations of R^, cf. 8.9.3. 

4.3.9.3. The spaces P"(R) are never simply connected; in fact, their fun¬ 
damental group is isomorphic to Z 2 for n > 2, since by 4.3.3.3 the map 
p : S" —» P"(R) is a twofold covering, and 5" is simply connected (18.2.2). 

4.3.9.4. The spaces P"(C) are always simply connected. They are not 
homeomorphic to 5^" unless n = 1. Real and complex projective spaces 
can be fully described from the algebraic topological point of view, via their 
cell decompositions, Betti numbers, cohomology rings over Z and so on. See 
[GG2, 90], jSR, 264-265). The ring structures of the Z 2 cohomology of P"(R) 
and the Z cohomology of P"(C) are particularly simple: they have a single 
generator, of degree one for P"(R) and degree two from P'*(C). 


4.4. Projective bases 


A coordinate system for an n-dimensional vector space requires n points 
(a basis); for an n-diniensional affine space, n-|-1 points (an affine frame); and 
we shall see that a coordinate system for an n-dimensional projective space 
requires n-|-2 points. One reason is the following: in the notation of 4.2.3, the 
n + 1 points m,- == p(e.) (i = 0,1,..., n) of P[E) are not enough to determine 
the homogeneous coordinates of a point (even up to a multiplicative scalar), 
since the rrii could equally well come from AjC,, for arbitrary A* € K*, not 
necessarily the same for all t. However, adding a (p + 2)-th element resolves 
the ambiguity: 

4.4.1. Definition. Let P(E) be an n-dimensional projective space. A 

(projective) base for P[E) is a family {n,}r=o.i.n +2 0 / n + 2 points of 

P{E) such that there exists a basis . of E satisfying mi = p(e,) 

for i = 1,..., n + 1 and mo = p(ei . -I- e„+i). 

This last conditions says that, in the system of homogeneous coordinates 
defined by the basis the point mo can be written as (l,..., 1). 

The next result, though elementary, is fundamental for what follows: 

4.4.2. Lemma. Let mi be a projective base for P(E). Two bases {e^}, 
{e(} satisfying definition 4-4-i necessarily proportional, i.e., there exists 
A € K* suck that ej = Aej for all i = 1,.. .,n ■+ 1. 
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Proof. Geometrically, this is a consequence of figure 4.4.2 and Thales’ theo- 
rem (cf. 2.5.1). Alternatively, we can use the assumptions p(e,) = p(e[) and 

p(ej +-h «'„ + 1) = p(ei + + en+i) to find X,Xi e K (t = 1,..., n + l) 

such that e|. = AiCj and 

+ ■ ■ + e'^ + 1 — A(ei + ■ ■ ■ + e„+i). 

Then 

Aej 4- • ■ + A„+i = Ajci + ■ ■ ■ + A„+ie„ + i, 
which implies A = A, for all t = 1,..., n + 1 because {e, } is a basis. □ 

4.4.3. Thus a projective base of P(E) is enough to associate homogeneous 
coordinates to points of P{E), since it determines a basis for E, well-defined 
up to a multiplicative scalar. These homogeneous coordinates are called the 
projective coordinates of the point with respect to the projective base. 

4.5. Morphisms 

4.5.1. Let E,E' be vector spaces, and let f 6 L[E\E'). We have f[Xx) — 
A/(i) for all X € E, so f is compatible with the equivalence relations defining 
P(E) and P{E'). There is, however, a substantial nuisance; the image f{E\0) 
is generally not contained in jE' \ 0, so the quotient map can only be defined 
from P(E) \ P(/“*(0)) to P[E’). We shall nonetheless talk about such 
quotient maps as if they were maps from P(E) —* P[E'). 

4.5.2. Definition. Let P(E), P(E') be projective spaces. A map g : 
P(E) —» P(E') is called a morphism if there exists a map f : L{E\ E') such 
that g is obtained from f by passing to the quotients, i.e., gop = p° f (in 
particular, g is generally defined on P(E) \ P(/~*(0)) only). In this case we 
put g = f_- We say that f is an isomorphism or a homography if f is a vector 
space isomorphism; homographies are really defined on the whole of P[E). 
The set of morphisms from P(E) to P[E') is denoted by M[P(E)-, P(E')), 
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and the set of isomorphisms by Isom(P(.E); P(£*)) . 


E\f-'(0) 

E'\0 


f 1" 

p(E)\p(r»(o)) 

P(E') 


4.5.3. Question. The first question is how good is the correspondence 
f ^ fj, the answer is that, for /, /' £ L(E-, E'), we have 

4.5.4 f = f' <==> f' = A/ for some A € K*. 

Proof. Observe first that g = f determines the kernel /“*(0). Now if a:, j/ 
are linearly independent vectors not in /“*(0), then /(x) and f(y) are also 
linearly independent. But f[p(x)) = /'(p(i)), and similarly for x + y and y, 
so there exist scalars A(x), X(y), A(x + y) such that 

/'(x) = A(x)/(i), /'(y) = A(y)/(y), /'(x + y) = A(x + y)/(x + y). 

Since f(x) and /(y) are linearly independent we conclude that A(x) = A(y) = 
A(x + y), showing that /' = A/ for some A. Q 

Property 4.5.4 can be interpreted as providing a bijection 

4.5.5 M*{P(E)-P(E'))^ P{L(E-,E')), 

where M*[P[E)-, P(E')) represents M{P{E);P(E')) without the trivial mor¬ 
phism arising from the zero map from E into E' (which is, by the way, the 
only morphism defined nowhere in P(.E)!) 

4.5.6. Remark. Commutativity of the base field is essential in proving the 
backward implication in 4.5.4. If the field is not commutative, the map A/ 
obtained from / € L[E\ E') is generally no longer linear. 

4.5.7. Bearing in mind the caveat in 4.5.1, we have, for / £ L{E',E') and 
g&L(E'-E"): 

9° f = 9° f- 

4.5.8. Every n-dimensional projective space over K is isomorphic to P'^{K). 
This, together with continuity, explains why in section 4.3 we only talked 
about P"(R) and F"(C). 

The following is a consequence of 4.5.5 and 4.5.7: 

4.5.9. Proposition. The homographies P(E) — ► P[E) form a group under 
composition of maps, called the projective group of E, and denoted by GP(E) 
or PGL(jE). There is a group isomorphism GP{E) = GL[E)/E* Me. 

4.5.10. Proposition (first fundamental theorem of projective 
GEOMETRY ). Let P{E), P(E') be projective spaces of same finite dimension, 
and {mj}, {m'} projective bases for P(E) and P(E'), respectively. There 
exists a unique homography g : P{E) —* P[E') such that m' = y(mi) for 
every i. 
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Proof. Let {ci}, {e'} be bases associated with {rwi}, {mj} as in 4.4.1. Define 
/ e Isom(£; E') by e' = f[^i) for all t; then f_ = g has the desired properties. 
If g and g' both have these properties, the homography g'~^ o g ; P{E) —» 
P{E) fixes the base {m, } pointwise, and must be of the form A Id^ by lemma 
4.4.2. □ 

4.5.10.1. Remark. This result is false if the field is non-commutative; see 
exercbe 4.9.8. 

4.5.11. Corollary. The group GP{E) acts transitively on P[E) and sim¬ 
ply transitively on bases of P(E). □ 

4.5.12. The name of the fundamental theorem of projective geometry for the 
apparently simple proposition 4.5.10 is justified by the difficulty in proving it 
from axiomatizations of projective spaces. See, for example, [V-Y, volume I, 
p. 95). 

4.5.13. Explicit calculation of MORPHISMS. Let {e,} and {e'} be 
bases for E and E', respectively. Consider / G L(E\E'), and let M(/) = (aij) 
be the matrix of / with respect to the bases {e,} and {e|}. In terms of the 
associated homogeneous coordinates in P(E), P(E') (cf. 4.2.3), we have 

• • • ) ^u+i)) = ( • • • ! “n+l.t 

^ « i 

Homogeneous coordinates are only defined up to a multiplicative constant; 
if we want a completely determined formula, we can use the charts in P(E) 
and P(E') associated with the respective bases. Take, for example, the charts 
+ 1, where n = dim(P(iJ)), m = dim(P(E')). Using 4.2.4.1 and 
4.2.4.2 we get the following formula for o f o 

‘’^rn+ 1 ° / ° ^n +1 ((*^1 1 ■ • • ) *'ri)) 

Si 

Naturally, this is only defined where the denominator is non-zero. 

4.5.14. Particular case: n = m = l. Take / e GP(iif2), that is 

/ G GL(/f^), and let the matrix of / in the canonical basis of (cf. 4.2.5) 
be M(/) = Then ?r2 o / o takes the form 

av -h 6 

V —♦ - - 

cv a 

for V ^ —d/c, or, in homogeneous coordinates. 






4.5. MorphismB 


99 


for t; ^ —d/c. Doing the same thing for other combinations of the two charts 
^2i one obtains the following table; 


4.5.15 



Observe that, for = R or C, 


a av -h b , 00+6 

- = lim -; and lim -- = oo, 

c »j—oo CD + a ti—-li/c cv + d 

thus corroborating the observation made at the end of 4.2.5, that p(l, 0) plays 
the role of infinity. We will return to this point in 5.2.5. 

4.5.16. Structure of HOMOGRAPHIES. Again we assume that P{E) is 
finite-dimensional. In view of 4.5.4, our objective is to study the structure of 
a map / e GL(.E) up to a multiplicative constant. The geometric structure of 
an element of GL(£) is described, in the algebraically closed case at least, by 
its Jordan decomposition; passing to P(.E) poses no special difficulties. We 
do not pursue this question here, but refer the reader to |FL, appendix l]. It 
is interesting to compare the modern treatment in this reference with an older 
exposition not using linear algebra explicitly, for example jV-Y, volume I). 

4.5.17. Proposition. Let g - f_e GP(.E) and m e P(E). Then m is a 
fixed point of g (i.e., m = g(m)) if and only if p“‘(m) ts an eigenspace of f. 

□ 

The conclusion is that the essential things in the study of morphisms / of 
P(E) are the eigenspaces of / in E, and not its eigenvalues (though these too 
merit interest, cf. 6.6.3). 

4.5.18. Corollary. If K = C, or if K = 'R. and the dimension of P(E) is 

even, every homography has at least one fixed point. □ 

For example, if / £ GL(£') has no repeated eigenvalues, / has exactly n + 1 
distinct fixed points, where n = dim(P(£^)) (exactly because of 4.4.2). In 
older books such points are called “double*. 

4.5.19. Involutions of GP(£;). If / e GL(fr) is involutive (i.e., p = 
/ o / = Id£;), so is / (in fact, we shall see in 6.4.6 a dazzling geometric 
construction for / in this case). Watch out, however: not every involutive 
g 6 GP(.E) is of the form / for / involutive. For example, for E = 
and M{f) = (j we have = Id/>i(R), since p = — Hrs. The right 
condition for = Id;^(£;| is then that p — Ald^ with A € K‘\ if /f is 
algebraically closed, one can write A = and then g = f_ — f , with 
(M-'/)^ = IdK. 
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4.5.20. Topology of GP(^). If = R or C and E is finite-dimensional, 
the projective group GP(^) is endowed with a canonical topology, the quo¬ 
tient of the topology of GP(£') by the action of K* Ue (cf. 2.7.1). The 
connectedness of GP(iJ) depends on the orientability of P(E); see 4.9.4 and 
4.9.5, or |FL, 228-230]. 

4.5.21. Remark. The isomorphisms P(E) —* P(E*) lead to a number of 
geometric results, which will be encountered in their natural context in 14.5, 
14.8.12. See also [FL, 260 ff.]. 

4.6. Subspaces 

Observe that if F C F is a vector subspace different from {0}, F is 
saturated under the equivalence relation ^ of 4.1.1. Thus we can identify 
P(F) with p{F) C P{E); in particulsu- the injection i : F —* E gives rise to 
the natural injection t ; P{F) —» P{E), which is a morphism. 

4.6.1. Definition. A subset V of a projective space P(E) is called a 
(projective) subspa.ce if there exists a vector subspace F of E such that p{F) = 
V. fVe consider P(F) — V with its natural projective structure. In particular, 
dimV = dimF — 1 is called the dimension of the subspace V'. 

4.6.1.1. We see that there is a natural bijection between the projective 
subspaces of P(E) and the vector subspaces of E. 

4.6.2. The subspace of dimension —1 is the empty subset. Zero-, one- and 
two-dimensional subspaces are the points, lines and planes of P(E), respec¬ 
tively. Projective subspaces of P(E) coming from vector hyperplanes of E 
are called (projective) hyperplanes of P(E); their set is denoted by M(P(E)), 
and we have (cf. 4.1.3.5) the following bijections: 

, . scf set 

4.6.3 )({P(E)) s )i(E) s P(E‘). 

4.6.4. Let {VijteJ s-** arbitrary family of subspaces of a projective space. 
The intersection (^,g/ V, is still a subspace. We deduce (as in 2.4.2.5) the 
following 

4.6.5. Proposition. Lets be an arbitrary subset of a projective space. The 

subspace (S) spanned by S is the smallest subspace containing S. It is the 
intersection of all subspaces containing S. D 

4.6.6. Definition. The points mi (f = l, ...,fc-fl) are called (projectively) 
independent if 

dim((mi,...,m)t+i)) = k. 

4.6.7. Examples. A single point is always independent. Two points are 
independent if and only if they are distinct, in which case they determine 
a unique line. Three points a,b,c are independent if they are distinct and 
each does not belong to the line determined by the other two: a ^ (6, c). 
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b ^ (c, a), c ^ (a, 6). In this case they'define a unique plane {a,b,c). Looking 
at it the other way around, we can start from the fact that two distinct 
points determine a unique line and the analogous fact for three points amd 
planes, and take these as the first axioms in establishing an axiomatic theory 
of projective spaces. See also |V-Y, volume I, 95]. 

4.6.8. Proposition. The points mi (t = l,...,fc+ 1) are independent if 
and only if mi ^ (mi,..., m,( • •• > mfc+i) for i = 1,..., A; + 1 (cf. 0.1). In 

an n-dimensional projective space, {mi}j=o.i.n+i forms a projective base if 

and only if the points are independent for all i = 0,1,... ,n + 1. 

Proof. The first assertion is just the projective version of a well-known fact 
from linear algebra. To prove the second assertion, take ei & E such that 
p(e,) = m,- (i = 0, l,...,m-|- l); since the points {m,},_i.^-n aJ^e in¬ 
dependent, {e,}i=i.n.,.1 is a basis of E. Put eo = 

that the are independent means that Aj ^ 0 (t = 1,..., n -f 1), so 

.n+i is still a basis of E, satisfying 4.4.1. □ 

4.6.9. Corollary. Let D,D' be projective lines and {a, 6, c} (resp. {a',b', 
c'}) distinct points on D (resp. D'). There exists a unique homography f : 
D —* D' such that f(a) = a', f(b) = b', and f(c) = c'. 

Proof. Prom 4.6.7 and 4.6.8, {a, 6,c} and {a', 6', c'} are bases for D and D', 
respectively. The corollary follows by applying 4.5.10. □ 

4.6.10. Axiomatic theories of projective spaces include axioms about the 
intersection of subspaces; in this case, they consist of the observation in 4.6.1.1 
and the classical linear algebra relation 

dim(P' -I- G) -f dim(F n G) = dim F -f dim G, 

which together give the following 

4.6.11. Proposition. LetV,W be subspaces of the same projective space. 
Then 

dim((P U VP)) -f dim(V n VP) = dim V 4 dim W. □ 

4.6.12. Corollary. 

i) 7/dim P 4 dim VP > dim(P(F)), then P n VP 0; 

ii) if dim(P(F)) = n, then n hyperplanes of P(E) have at least one point 
in common; 

iii) if H is a hyperplane and m a point outside H, every line D through m 
intersects H in exactly one point; 

iv) two distinct lines in a projective plane intersect in exactly one point. □ 

4.6.13. This corollary displays the superiority of projective geometry over 
affine geometry; intersection properties for subspaces can be stated without 
special cases; there are no parallel subspaces (cf. 2.4.9). We shall study in 
chapter 19 a geometry in which there are multiple parallels (19.3.2). 

4.6.14. Equations of subspaces. Calculation of subspaces is carried out 
with the help of 2.4.8.1 and 4.6.1.1. 
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4.6.15. Subspaces and topology. Here if = R or C and all projective 
spaces are finite-dimensional, and have the topology introduced in 4.3.1. Since 
the complement E \ F of & vector subspace E C F is dense in F (U E ^ F), 
we get that the complement P{E) \ P(F) of a subspace P(F) is dense in 
P{E). This property is analogous to 2.7.1.2. Similarly, every subspace is 
closed in P{E). On the other hand, 2.7.3.2 has no analogue here: for any 
hyperplane P{H) in P[E), the space P{E)\P(H) is path-connected. In fact, 
P(E) \ P(H) is homeomorphic to an affine space (cf. 4.3.3.1); alternatively, 
one can construct directly an arc joining two given points of P(E) \ P(H) 
(figure 4.6.15). 




4.6.16. Finite fields. The points, lines and other subspaces of a projective 
space P{E) over a finite field with k elements form interesting configurations. 
For example, each line contains the same number of points, and each point 
lies in the same number of lines. Figure 4.6.16 represents the case k = 2, 
dim/’(£') = 2: the space has seven points and seven lines (one of which is 
represented by a circle). For more details on projective configurations, see 
|H-C, 94-143), an easy reference with many drawings, and [DI], a more recent 
reference. See also 4.9.11. 


4.7. Perspective; aerial photography 

4.7.1. Let H,H' be hyperplaiies of a projective space P(E) and m a point 
not in H or H'. Given an arbitrary i e JT, there exists by 4.6.7 and 4.6.12 
a unique line (m, x), which in turn intersects H' in a single point ^(x) = 
H' n ((m, x)). This defines a map g •. H —* H'. 

4.7.2. Proposition. The map g belongs to lsom(H‘, H'). 
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Proof. Let V and V' be the hyperplanes of E such that H = p(V) and 
H' = we are looking for / 6 L(K; V') such that J_= 9- It is enough to 

take the restriction to V of the linear projection onto V' in the direction of 
the one-dimensional vector subspace p~*(m) (i.e., the projection associated 
with the direct sum E — p”*^(m) ® V). □ 




Figure 4.7.1 Figure 4.7.2 

4.7.3. The map g is called the perspective of center m taking H onto H'. 
This is consistent with the everyday usage of the word perspective: the cor¬ 
respondence between two planes established by our vision or a photograph. 
H contains the plane object to be photographed, and H' contains the film. 

In aerial photography, the problem arises of matching pwtial shots of 
an area which is too large to fit in a single photo. Assuming the area to 
be flat, it is possible to use perspectives to make overlapping parts coincide 
perfectly; the correspondence between matching points in two photographs 
is a honiography, being the composition of the perspectives of the two photos 
(figure 4.7.3.1), and thus can be composed with another homography to give 
the identity. By proposition 4.5.10, it is enough to match four points in the 
two images to obtain a perfect correspondence. See [BUR, 36-51). 



Figure 4.7.3 
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Figure 4.7.4.1 


Figure 4.7.4 2 
(Source: IGN, France) 
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4.7.4. As a practical exercise, the reader can locate four points on figures 
4.7.4.1 and 4.7.4.2, transfer them to a blank sheet of paper, and verify that 
the two sets of points cannot be mapped into one another by an afline map. 

4.7.5. Aerial photography and cartography. It is only fair to 
warn the reader that, when a map is assembled by patching together aerial 
photographs, the matching of overlapping portions of the patches can never 
be perfect as indicated in 4.7.3. This is because the surface of the earth, even 
not considering local elevations, is not plane but spherical (in fact, ellipsoidal, 
cf. 18.1.5.3). One has to work in three dimensions, with the help of intricate 
and expensive devices called stereocomparators (see [BUR, 230 and plates 
21-23]); only thus can one hope for a theoretically perfect matching. On the 
subject of cartography, see 18.1.5 and 18.1.8. 

4.7.6. Perspective still makes sense in afline spaces, but then it is only defined 
on a subset of the hyperplane being observed. The perspective g with center 
m from H onto H' is defined on H\D, where D is the intersection of H and 
the hyperplane parallel to H' and going through m. This affine perspective 
is often used as an introduction to projective geometry, in particular to the 
projective completion of an affine space discussed in the next chapter. 



4.8. The non-commutative case 

4.8.1. One can define projective spaces, morphisms and subspaces for skew 
fields. We have mentioned en passant (4.5,6, 4.5.10.1) certain results which 
do not hold in this case; it is a good exercise for the reader to sift through the 
chapter to find what results really require commutativity, giving counterex¬ 
amples when appropriate. 
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M. C. Escher, Tower oj Babei, 1928. Woodcut, 62 x 38.5 cm. 
Escher Foundation-Haags Gemeentemuseum (The Hague) 

See (ERj 
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4.8.2. For instance, one can define projective spaces of arbitrary dimension 
n over the skew field H of quaternions (8.9): 

P"(H) = 

Such spaces are Hausdorff and compact with the natural topology (cf. 4.3). 
The projective line P*(H) is homeomorphic to the sphere S* (cf. 4.9.7). 
The algebraic topology of P"(H) is also completely known; see, for exam¬ 
ple |SR, 265], or the interesting article [E-Kj. 

4.8.3. It is even possible to define projective spaces over structures more 
general than fields. An important example is the division ring of octonions 
or Cayfey numbers Ca, which is an eight-dimensional real vector space with 
a non-associative multiplicative law. One can construct the projective line 
P*(Ca), homeomorphic to the sphere S®, and the octonion projective plane 
P^(Ca); see jPO, chapter XIV], |B02, 199], ]E-K, 12], and especially ]BES, 
chapter 3]. 

The reason why one cannot define projective spaces of dimension greater 
than two over Ca is that such that a projective space (with the standard 
axioms of intersection of subspaces) must necessarily satisfy Desargues’s the¬ 
orem (see 5.4.3 and 5.4.4), which, in turn, implies that the base structure 
is associative (JPT, chapter 3] or JHA, 374); the last chapter of this latter 
reference, on projective planes, is very interesting.) 

4.9. Exercises 

4.9.1. Let P(E) be a finite-dimensional real or complex projective space. 
Show that, for every hyperplane H, the bijection P{E) \ P[H) — » Eh defined 
in 4.3.1 is a homeomorphism. 

4.9.2. Prove the last paragraph of 4.3.9.1. 

4.9.3. Prove that P”(R) is homeomorphic to the quotient topological space 
of the closed ball B(0,1 ) by the equivalence relation ~ defined by z ~ y if 
and only if z = y or z 6 5(0,1) = 5”“^ and y = — z. In particular, study 
the cases n = 1 and n = 2 (cf. 4.3.6 and 4.3.9.1). 

* 4.9.4. For a real projective space of finite dimension n, find the sign of the 
Jacobian of the transition maps 4.2.4.3. Deduce that the manifold P"(R) is 
orientable if n is even and non-orientable if n is odd. 

* 4.9.5. Prove the result in 4.9.4 by studying the connectedness of the projec¬ 
tive group GP(P"(R)). 

4.9.6. Study the images and inverse images of subspaces under morphisms. 

4.9.7. Show that P^(H) is homeomorphic to the sphere S*, where H is the 
skew field of quaternions (cf. 4.8.2). 



no 


Chapter 4. Projective spaces 


4.9.8. Determine exactly where the assumption that the base field is com¬ 
mutative intervenes in the proof of 4.5.10. Give a counterexample when the 
field is non-commutative. A good place to try is P"(H) (cf. 4.8.2): you can 
even try to find an infinite set of points with the property that every subset 
with n -f 1 points is linearly independent, but which is nonetheless invariant 
under a homography different from the identity. 

* 4.9.9. Find a model, in the usual three-dimensional space, for the configura¬ 
tion formed by the points, lines and planes of Draw pictures. 

* 4.9.10. Let Hi be a family of hyperplanes in the projective space P{E) of 
finite dimension n; find a relation between dim f); Hi in P[E) and dim (JJ, Hi) 
in P(E*). 

* 4.9.11. Let if be a field with k elements, and P(E) a projective space of 
dimension n over K. Show that the cardinality of the set of p-dimensional 
subspaces of P(E) is equal to 

(/b~+i - - Jfc) • ■ - ifc'’) 

(/b»*+‘ - l)(/fcP+i - jfc) (jfc»>+i - jfci*) ■ 

Show that the order of the projective group GP(.E) is 

(/b"+^ - l)(/b'‘+‘ - fc) • {jfc”+* - jfc"-^)ik” 

* 4.9.12. Mobius TETRAHEDRA. Construct, in a three-dimensional projec¬ 
tive space, two tetrahedra {a, 6,c,d} and {a',b',c',d'} such that each vertex 
of the first belongs to a face of the second and vice versa (i.e., o 6 {b',c',d'), 
a’ € {b,c,d), and so on). For an explanation and a generalization of this phe¬ 
nomenon, see 14.5.5 and 14.8.12. For a study of Mobius tetrahedra formed 
by mechanical linkages whose vertices describe curves of constant torsion, see 
|BA|. 


b* 



Figure 4.9.12 






Chapter 5 

AfRne-projective relationship: 
applications 


In this chapter we add to an affine space X the set P{X) of 
directions of its lines, obtaining a projective space in which X 
is naturally embedded. Conversely, the complement of a hyper¬ 
plane in a projective space has a natural affine structure. This 
process of projective completion can be performed in more or 
less natural ways (section 5.0). In section 5.3 we translate par¬ 
allelism behavior in X in terms of intersection properties in the 
completion of X. This has numerous applications, some of which 
are given in section 5.4. 
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All fields are assumed commutative. 


5.0. Introduction 

5.0.1. We have mentioned in 4.0 the shortcomings of affine geometry. In 
particular, following Desargues, we want to extend an affine space X into 
a projective space X, the union of X and its points at infinity. These are 
defined as the directions of lines of X, and their set P(.^) is also written oox. 
Thus X = Xu oox = X U P(X), this being a priori just a disjoint union in 
the set-theoretical sense. The non-trivial part is making X into a projective 
space. This can be done axiomatic ally, but this is not the point of view we 
have been adopting in this book (cf. 4.1.2 and |AN), |HA), (DI), [H-P], [PT]); 
instead we present three algebraic constructions for the projective completion. 

5.0.2. In the coarsest approach, let X have finite dimension n, and let 
.n iin affine frame of X. The desired completion is 

and the embedding X —* P"(K) is given by 
5.0.2.1 x= (Ai,...,A„)h^p{A„...,A„,1 ) 

(cf. 2.2.9 and 4.2). We see that the complement of the image of X in P'^[K) 
is exactly the hyperplane = P{K’^ = K" x {0}), which is indeed 

identified with P{1t) by 5.0.2.1. 

5.0.3. A more sophisticated construction, suggested by 5.0.2, consists in 
vectorializing X aX a & X, and considering the projective space P(A'o x K), 
where X K is the direct product of the two vector spaces Xa and K. The 
embedding of X into P(Xa x K] is furnished by composing the identification 
between X and the section Xa x {1} of X„ x K with the projection onto the 
quotient: 

5.0.3.1 1). 

The complement of the image of X in P(Xa x K) is P(A^a) — P(.^)- 
p(x,i) K 



Figure 5.0.3 
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5.0.4. The most sophisticated approach utilizes the universal vector space X 
associated with X: just take X = P(.y), and embed X in P{^) by means of 

5.0.4.1 

where X is understood as an affine hyperplane of X (cf. 3.1.6). 

5.0.5. The disadvantage of the first two constructions is that they are not 
intrinsic. In 5.0.2, for example, we do not know a priori that the projective 
space we obtain is canonically associated with X (that is, “depends only on 
X” in some sense). The same holds for 5.0.3, though in this case we can 
already perceive that P{Xa) is always identical with P[X), since the choice 
of o only changes things by a translation in X, and translations preserve di¬ 
rections (cf. 2.3.3.4). On the other hand, 5.0.2 and 5.0.3 have two advantages: 
first, they offer an explicit base in which to calculate; and second, being more 
elementary, they allow a simpler treatment of the complexification questions 
which will arise in chapter 7. 

5.1. The projective completion of an affine space 

5.1.1. Let X be an affine space and X the associated universal vector space 

X = XU(K* X X) 

(cf. 3.1). Define X X = P(.Y). Recall that X is embedded in X via x i—» (x, 1), 
and remark that the canonical projection p : X X = P{X) is injective when 
restricted to X; thus X can be identified with a subset of X. This gives rise 
to a partition of X into two sets, X = X U P[X). We put OCX = P(X), so 
we can also write X = X U cx3x- The conclusion is that the affine space X 
is embedded in the projective space X, the complement of the image of the 
embedding being a hyperplane oox = PlX). 

5.1.2. Conversely, if P(E) is a projective space and P{H) C P[E) is an 
arbitrary hyperplane, the complement P{E) \ P[H) is an affine space in a 
natural way: P(E)\P(H) = Eh (cf. 2.2.6). By 2.2.7 and 3.1.6, the processes 
of passing from X to X and from P(E) \ P(H) to Eh are inverse to each 
other. 

We can summarize the aforementioned results from chapters 2 and 3 in 
the following 

5.1.3. Theorem. 

i) Let X be an affine space and X = P(X). Identify X with a subset of X, 
also denoted by X , via x i—♦ p(i, 1). The complement X \ X is equal to 
P{X) = oox. The space X is called the (projective) completion of X, 
and oox = P{X) is called the hyperplane at inhnity of X. If X,X' are 
affine spaces and X,X' are their completions, for each affine morphism 
f 6 j4(X;X') there exists a unique projective morphism f € M(X;X') 
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such that j\x = /• Moreover, /(oox) C oox' o,nd f\oox — / (cf. 2.3.1 
and 4.5.2). 

ii) Let {E,H) be a pair formed by a vector space E and a hyperplane H C 
E. Then Ejj = P(E) \ P{H) (cf. 4.1.3.4) has a natural affine struc¬ 
ture over L(E/H]H'). Let [E',H) be another such pair, and take g € 
M{P(E)-, P(E')) such that the domain of g contains Eh (cf. 4.5), and 
that g{EH) C Ejj,. Then the restriction gls^ 

iii) The correspondences X >-+ [X, X) and (E, H) Eh are functorial and 
inverse to one another. 

Proof. This essentially follows from 2.2.6, 2.3.7, 3.1.6 and 4.5. For (ii), we 
must show that g|c„ € A^Eh', E^h’)- Take / e L(E; E') such that / = g; we 
must show that, in fact, / € Lh,H'(E‘,E') (cf. 2.3.6). 

By assumption, g is defined on the whole of Eh , which means (cf. 4.5.2) 
that /~*(0) C H; and g{EH) C E'jj, implies f{x) ^ H' for any x ^ H. This 
shows that f(H) C JP and / : E/H —> E'/H' is injective. 

Functoriality follows from 3.2.1. In order to prove rigorously that the 
correspondences in (iii) are inverse to each other, we would need to use cat¬ 
egory theory; in that nomenclature, theorem 5.1.3 says that {E,H) —+ E/H 
is a “completely faithful” functor. □ 

For practical applications, (i) and (ii) suffice (in fact, just 5.0.2.1 and 4.2.4.1); 
(iii) was thrown in for the sake of completeness. 

5.2. Examples 

5.2.1. Characterization of dilatations. According to 2.3.3.12 and 
4.5.9, one can characterize the dilatations of an aflfine space as the aflfine maps 
/ such that / leaves the hyperplane at infinity pointwise invariant: 

/ e Dil(X) <=> f\oox = Woox • 

5.2.2. Affine and projective groups. The maps g e GP(.X') which 
leave oox globally invariant are exactly those of the form g = f, where / is an 
arbitrary element of GA(X). In other words, we can identify GA(X’) with a 
subgroup of GP(X), namely, GPooxl-^)- Gayley discovered that all classical 
geometric groups can be realized as subgroups of the projective group of a 
suitably chosen projective space: see 9.5.5.2 and 18.10.1.5. 

5.2.3. The case X = K. In this case P[X) = P(K) = P°(K) consists of 
a single point, the point at infinity of K, and we have K = K Uoo. Observe 
that for K = R the completion R = R U oo is distinct from the extended 
real line used in analysis, Ru {—oo, -foo}. In projective geometry there is no 
distinction between “approaching -foo” and “approaching —oo”. 

Returning now to K arbitrary, observe that can be naturally iden¬ 
tified with K^, and thus K with F^(ff’); under this identification, x € if 
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corresponds to p(x, l) and oo to p(l,0). Putting K = K U oo, table 4.5.15 
becomes 


5.2.4 c ^ 0 < 


at + b , d 

t —— j i t ¥= — 

ct+ d 
d 

— t-+ oo 
c 

a 

oo - 
c 




^ at + b 

OO OO 


where the morphism / under consideration has matrix M(f) = (^^). 


5.2.5. For K = R or C and X finite-dimensional, 4.3.2 shows that the 
embedding of X into .Y is a homeomorphism onto its image; 4.6.1 shows that 
X is open and everywhere dense in X. This, together with 5.2.3, explains the 
remark at the end of 4.5.14. 

By the way, all morphisms in A{X-,X') and M{^P{E)\P[E')) are con¬ 
tinuous with respect to the canonical topologies. In arbitrary dimension, / is 
the continuous extension of / for / 6 A[X;X'). 

5.2.6. For if = R or C, 4.3.2 and 4.3.3 show that K is exactly the Alexan- 
droff compactification of K. 





Figure 5.2.6 


5.3. Relationship between affine and projective 
subspaces. Parallelism 

We will now show that our construction assigns to parallel affine lines 
the same point at infinity, as promised. 
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5.3.1. Let S C X be an afBne subspace of an affine space X; then S is 
embedded in X. By 2 .4. 2 .3, the projective subspace (5) spanned hy S C X 
can be identified with the projective completion S of S; thus we have {S) = 
S = SUoos, where 005 is a subset of X, namely, 005 = oox H (S) = oox n5. 
Note that 005 = P(^) C P(X) = oox- FVom definition 2.4.9. 1 , we get the 
following result: 

5.3.2. Proposition. Tke correspondence S y-* S is a bijection from the set 
of affine subspaces of X into the set of projective subspaces of X not entirely 
contained in oox • We have S = 005 U 5, where 005 = oox H S. Moreover, 
if S, S' are affine subspaces of X, we have 

005 = 0O5< <==>■ S II s', 

OOg C OOgi <==>■ S < S'. 

5.3.3. Remark. For if = R or C and X finite-dimensional, 5.2.5 shows 
that S is just the topological closure of S in X. 



5.4. Sending objects to infinity; applications 

5.4.1. Proposition (Pappus’ theorem—projective version). Let 

P{E) be a projective plane, D and D' two distinct lines in PIE), and a, b, c, o', 
b', c' six distinct points such that a,b,c € D\[DnD'), a',b',c' e D'\[DnD'). 
Then the three points {a,b')n{a',b), {b, c')n{h',c), {c,b')n{c',b} are collinear. 

Proof. Remark first that these points exist by 4.6.12. Consider the points 
7 = (o, 6 ') n {a',b), a = {b,c') n ( 6 ',c), and the projective line V = (a, 7 ) 
joining them. Introduce the affine plane X — P[E)\V (cf. 5.1.3. (ii)); observe 
that a,b,c,a',b',c' € X. In X we have ( 0 , 6 ') || (o',fc) and {b, c') || {c',b), by 
5.3.2 and the construction of V using the affine version of Pappus (2.5.3), we 
get (a, c') II {a',c), and another application of 5.3.2 gives 

/0= (a,c')n(a',c) e (a, 7 ). □ 

We get the next result by applying 5.4.1 to the completion X of X: 
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5.4.2. Corollary (Pappus’ theorem—second affine version). 

Let X be an affine plane, D and D' two distinct lines of X, a,b,c,a',b',c' six 
distinct points such that a,b,c € B\(Dn D'), a', b', c' e D' \ (D D D'). Then 
the three points {a,b') D (a',b), (b,c') fi {b',c), (c, 6 ') O (c', 6 ) are collinear in 
the following sense: if all three exist, they are collinear; if only two of them 
exist, the line joining them is parallel to the two lines defining the third (if 
none of them exists we are in the case discussed in S.5.S). □ 

Here we can see how much simpler statements of projective results are than 
the statements of corresponding results of affine geometry, which must ac¬ 
count for lots of particular cases. See also 16.8.19. 

5.4.3. Proposition (Desargues’s theorem—projective version) 

Let P(E) be a projective space and s,a,b,c,a',b',c' seven distinct points of 
P(E) such that s,a,b,c and s,a',b',c' are projectively independent and that 
o' 6 {s,a), b' € {s,b), c' € (s,c). Then the three points {a,b) n {a',b'), 
{b,c)n{b',c'), (c, a) n (c'j o') are collinear. In other words, if the lines joining 
the corresponding vertices of two triangles are concurrent, the intersection 
points of the corresponding sides are collinear. 

5.4.4. The proof is radically different depending on whether the dimension 
of P{E) is equal to or greater than 2. If dim(F(jE')) > 3, we just use the 
intersection properties for subspaces, and the result is valid in any axiomatic 
theory of projective spaces (cf. 2.6.7); if dim(P(£')) = 2, one must resort 
to theorem 2.5.4, which requires associativity of the base field (cf. 2.5.5 and 
4.8.3). 

5.4.5. Suppose first that the projective space Z spanned by our seven points 
has dimension three; the conclusion follows trivially from 4.6.12, since the 
three points being considered belong to the two planes (a, 6 , c) and (a', 6 ',c'), 
whose intersection is a straight line (else Z would be two-dimensional). 

Now suppose dim(P(£')) > 3 but dimZ = 2; we choose an arbitrary 
point m ^ Z and two points a, a' on the lines (m, a), (m, a') such that 
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S 



5 € {a, a') and a ^ a, a' ^ a' (such points can always be found because a 
projective line contains at least three points, cf. 4.1.3.7). 

Then the seven points s,a,a',b,b',c,c' span a three-dimensional sub¬ 
space by construction, and thus satisfy the statement of the theorem as shown 
in the first step. But the projection from P(E) \ {m} onto Z with center m 
takes a, a' into a, a' and preserves intersections and collinearities, so the result 
follows for S, a, a',b, b', c, c'. 



5.4.6. If dim(F(£')) = 2, we use a proof similar to that of 5.4.1, sending to 

infinity the points (a, 6) n(a',6') and (b,c) fl {b',c'). The conclusion follows 
from the converse of 2.5.4 (cf. 2.8.6). □ 

5.4.7. Corollary (Desargues’s theorem—second affine ver¬ 
sion). Let X be an affine space and s,a,b,c,a',b',c' points satisfying the 
conditions in 5.4-3. Then the three associated intersection points are colli- 
near (allowing for special cases when some of then do not exist, as in 5.4.S). 

□ 

5.4.8. Theorem (second fundamental theorem of projective 
geometry). Let P{E) and P{E') be projective spaces of same finite di¬ 
mension > 2, over fields K and K', respectively. Let f : P(E) —* P{E') be 
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a bijection taking triples of collinear points in P(E) into collinear points in 
P(E'). Then there exists a semilinear bijection f i E —* E' such that f = f. 

Proof. As in 2.6.6, one shows that / transforms hyperplanes of P{EJ) bi- 
jectively into hyperplanes of P{E'). One then fixes a hyperplane P(H) C 
P{E), whose image is the hyperplane P[H') C P(E'). The restriction 
9 — f\piE)\Pitn ■ Eh —* Efji is a map from the affine space Eh into the 
affine space E'jj,, satisfying the conditions of 2.6.3. Thus g is semiaffine; by 
an easy extension of 5.1.3 to the semiaffine case, g can be extended to a semi¬ 
morphism g : P(E) — » P(E'), which coincides with / on Eh by construction, 
and on P{H) because f\p(H) — 9 5-1.3). □ 

5.4.9. For various refinements of 5.4.8, see [FL, 83 ff. and 267 ff.]; see also 
[BI5, 415-416, exercises 16 and 17]. For a generalization of 5.4.8, see [TS, 
p. Vlllj. 

5.4.10. We shall have ample opportunity to employ this technique of send¬ 
ing points to infinity: see 5.5.3, 6.4.4, 6.4.8, 6.4.10, and a good portion of 
chapter 17. 

5.5. Exercises 

5.5.1. State and prove the converses of 5.4.1 and 5.4.3. 

* 5.5.2. Draw the figures for the theorems of Pappus and Desargues (5.4.2 and 
5.4.7) when there are points at infinity. 

* 5.5.3. Give a simple proof for the existence of Mobius tetrahedra (cf. 4.9.12 
and [B-P-B-S, problem 4.6|) by sending lots of points to infinity. 

* 5.5.4. Suppose you are given a piece of paper with one point marked and seg¬ 
ments of two lines which intersect outside the paper. Using only a straightedge 
(ruler), draw the line that joins the given point with the intersection point of 
the two lines. 



Figure 5.5.4 Figure 5.5.5 


* 5.5.5. Now suppose you are given two points, but your ruler is too short to 
connect them. Draw the line joining the points. 







120 


Chapter S. Affine-projective relationship: applications 


5.5.6. Study and comment on the proof of the fundamental theorem of 
projective geometry given in [DX, 28 ff.]. 

5.5.7. Give necessary and sufficient conditions for the line (a,)9, 7 ) in 5.4.1 
to go through D n D'. 

* 5.5.8. Hexagonal webs. We shall define a web in a real affine plane P as 
the following set of data: an open set j4 in P, and for each point a in j 4 three 
distinct lines di(a) (t = 1,2,3) in P which go through a and which depend 
continuously on a. Show that for b on di (a) close enough to a, we can define 
six points ( 6 <)i,...,e follows: 

bi — d 3 (b) n d2(a), b2 = di(bi) n d3(a), 63 = <f2(^>2) ^ ^i(“), 

^>4 — <^ 3 ( 63 ) n d 2 (fl), ^5 <^ 1 (^ 4 ) n d 3 (a), fee = ^^ 2 (^ 5 ) G di (a). 

A web is said to be hexagonal if be = 6 for all sufficiently close o and b. 

Let (p,)i=i, 2,3 be three points of P, not on the same line, and let A be 
the complement of the three lines which connect each pair of points p,. We 
define a web by setting di(a) = (a,pj) for every a & A. Show that this web b 
hexagonal. 

+ 5.5.9. More hexagonal webs. Show that the web in figure 5.5.9.1 
is hexagonal. More generally, consider a conic section C and a point p not 
situated on the conic. Assign to any point in the complement of CUp the two 
tangents to C through x and the line xp. Show that the web thus obtained 
is hexagonal. 

For more on webs, see [BL-BO] and [CH-GR|. For figure 5.5.9.2, see 
[B-P-B-S, remark on p. 166). 
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Figure 5.5.9.2 



Chapter 6 

Projective lines, cross-ratios, 
homographies 


This chapter introduces a homography invariant of quadru¬ 
ples of points on projective lines, called the cross-ratio. This 
invariant plays a fundamental role in projective geometry; one 
particular case is the notion of harmonic division. 

Cross-ratios will be a recurring theme in this work, com¬ 
ing in sometimes unexpectedly. Examples are the Laguerre for¬ 
mula (8.8.7), hyperbolic geometry (chapter 19), the Ricatti equa¬ 
tion (6.8.12), and differential geometry (6.8.20). The ubiquity of 
cross-ratios illustrates the importance of Cayley’s result (5.2.2). 
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All fields are assumed commutative. 


6.1. Definition of cross-ratios 

6.1.1. Corollary 4.6.9 can be rephrased by saying that triples of distinct 
points on a projective line are indistiguishable from the point of view of the 
projective group, or again that the projective group GP(£7) is 3-transitive, i.e., 
transitive on the set of triples of distinct collinear points of P(E) (cf. 1.4.5). 

You may have suspected by now that this is no longer true of four points; 
indeed, we shall use a scalar (in fact, an element of K) to classify the orbits 
under GP(E) of quadruples of collinear points. 

6.1.2. Definition. Let D be a projective line over K and {o, b, c,d} a 
quadruple of points of D such that a,b,c are distinct. The cross-ratio of the 
four points, denoted by [a,b,c,d], is the element of K — K U oo given by 
|a,li,c,dj = fa.b.c(d), where fa..b.c denotes the unique element of hom(D; K) 
such that f[a) = oo, f(b) = 0 and /(c) = 1 (cf. 4.6.9). If 

{a, fc,c,d} = {ai}i=i. 2 . 3 . 4 > 

we also write (<ii, ^21 ® 3 i <* 4 ] = [®«)- tire collinear points in a pro¬ 

jective space, the first three of which are distinct, their cross-ratio is, by defi¬ 
nition, their cross-ratio as points of the line that contains them. If {a,b,c,d} 
are collinear points in an affine space X over K, the first three of which are 
distinct, their cross-ratio is, by definition, their cross-ratio as points of the 
projective completion X of X. 




The following results are a consequence of 4.6.9; 

6.1.3. Proposition //[o,b, c,dj = 00 (resp. 0, \), then d = a (resp. d=b, 

d — c), and conversely. If \a,b,c,d] € if \ {0,1}, then d ^ a,b,c, and 
conversely (since if \ { 0 , 1 } = K \ { 00 , 0 , l}). □ 

6.1.4. Corollary. Let D,D' be projective lines and {at}«=i. 2 . 3.4 (resp. 

{aj }i=i. 2 , 3 . 4 .) points in D (resp. D'), the first three elements of each set being 
distinct. Then there exists f € Isom(i?;/?') such that f{ai) — a[ for all i if 
and only if [oi] = [aj]. □ 
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6.1.5. Corollary. LetD be a projective line anda,b,c three distinct points 
of D. For any k & K there exists a unique d G D such that [a, b, c, d] = A. □ 

6 .1.6. Proposition. Let D = P{E) be a projective line, a,b,c distinct 
points of D and x,y & E such that a = p(x), b = p(y) and c — p(x + y) 
(cf. 4.4.1). Then we have (cf. 5.2.3) 

d = p(kx + hy) <=> [a, 6 , c, dj = p(k, h) £ P{K^) = K. 

Proof. According to 5.2.3, we have, writing {ei,e 2 } for the canonical basis 
of K'^: 

P(ei) = P(I.O) = oo, p(« 2 ) = p(0,1) = 0, p(e,-f ea) ==p(l,l) = 1. 
Thus fa.b.c = /) where / £ L{E-, K^) is defined by 

f(x) = et, f(y)=e 2 , /(x + y) =/{ei-t- 62 ). 

But then 

/(d) = fjj){kx -I- hy)) = p{f(kx -t- hy)) = p(k, h). □ 

6.1.7. Remarks. Proposition 6.1.6 is often used as an alternate definition 
for cross-ratios. It is, in fact, the only possible definition when the base field 
is not commutative (cf. 6.8.13).skew fields 

One can also impose formula 6.2.3 as a definition, proving that it is 
invariant under the linear group GL(2;ir), which is easy. This course b not 
entbely unjustified if one starts by systematically looking for invariants of 
homography; this point of view b adopted in |DI-CA, 23]. See also [SFl], 
[SF2] and 18.10.7. 

6.2. Computation of cross-ratios 

6.2.1. The question b to compute |aj], where the Oi are given by their 
homogeneous coordinates (cf. 4.2.3) in an arbitrary basb B of E (where D = 
P(E)). Let Oj = p(xi) = p(Ai,/r,), for i = 1,2, 3, 4. Write Z 3 = axi -f 0X2 
and Z4 = 7Z1 -f 6x2', t'hen 

'y 6 

I 4 = -(azi) -I- ^(/?X 2 ), 
a p 

so that [aj] = p['^/a,6/P) (cf. 6 . 1 . 6 ). Using Cramer’s rule to compute 
a,0, 7,6 , we have 

^ ^ det(z 3 ,Z 2 ) ^ ^ det(zi,Z 3 ) ^ ^ det( 14 , 12 ) ^ _ det(zi,Z 4 ) 

det(xi,X 2 )’ det(xi,X 2 )’ det(xi,X 2 )’ det(xi,X 2 )’ 

where the determinants are with respect to the basis B. Hence 


f 

A4 

X2 


Ai 

A4 



M4 

M2 


Ml 

M4 


V 

^3 

M3 

A2 
M2 ' 

1 

Ai 

Ml 

A3 

M3 

y 


6.2.2 
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This expression can be written more compactly as follows (under the conven¬ 
tion, compatible with 5.2.4, that division by zero gives oo 6 K): 


6.2.3 


“i = 


A 3 Ai 


A 4 X 2 

Ml 


M4 M2 

A 3 A 2 


A 4 Ai 

M3 M2 


M4 Ml 


Let D be an affine line and {a,} four points, the first three of which are dis¬ 
tinct. Take an arbitrary system of affine coordinates, and let the coordinates 
of Xi be a,- (i = 1,2,3, 4). Then we can write a,- = p(xj, 1) in D\ thus 6.2.3 
shows (cf. 2.4.6) that 


an A t 1 (^3 - 2:i)(l4 - X2) X3X1 X4X| 

1“>I = ?-w-i = —r —r - 

(X3 - X2)(X4 - Xi) X3X2 X4X2 

In particular, if a, 5, c are three points on a line of an affine space X, we 
have in X (cf. 5.3.1) 


6.2.5 [a, 6 ,c, ooc] = —. 

cb 

Formula 6.2.4 furnishes an elementary definition of cross-ratios within the 
affine framework. 

6 . 2 . 6 . The moral of the story is that formula 6.2.5 is a geometric 
confirmation of the fact that the complement P(E) \ P(H) = Ej] = X of 
a hyperplane in a projective space has a natural affine structure (cf. 5.1.3 
and 2.2.6). For if this is so, one must be able to define geometrically no¬ 
tions such as the sum 6 + c in a vectorialization X„ of X, and the quotient 

co /cb for collinear points. Formula 6.2.5 answers the second problem and the 
parallelogram rule (cf. 2. 6 .6.3), interpreted in P{E), answers the first (figure 
6.2.6). 




6.2.7. Geometric computation of cross-ratios. See 6.5.6, 6.5.10, 
and, for the case K = C, 9.6.5. 
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6.3. Action of permutations 

Let {a,} be four distinct points of a projective line D (which forces 
K ^ Z 2 !). Their cross-ratio depends on the order they are considered; the 
dependence is expressed in the following 

6.3.1. Proposition. 

[a,fc,c,d) = (fc,a,c,d]~* = (a, 6 ,d,c)“\ 

[a, 6 ,c,d) -f (a,c, 6 , rf] = 1 . 

Proof. The first two equalities come immediately from 6.2.3. For the last 
one, put a — p(x), b = p(y), c = p{x + y), d = p(kx + y); then |a, b, c, d] = k. 
Replace the basis {x, y} by the basis {—x, x-fy}. Then write y = ~x+ (x+y) 
and fcx -f y = (l — fc)(—x) -t- (x + y), whence |a, c, fc, d] = 1 — k (cf. 6 . 1 . 6 ). □ 

Since the three formulas of 6.3.1 correspond to three transpositions of {a, 6 , c, 
d} which generate the group of permutations of this set, it is easy to figure 
out the effect of any permutation. To make this more precise, we complicate 
it a bit: 


6.3.2. Proposition. Set K* = Ff \ {0,1} and let St be the symmetric 
group of {1,2, 3,4}. Let D be a projective line, (a,) four distinct points of D, 
k = [oi] their cross-ratio. Then the cross-ratio a(fc) = lOff(t)) € K* depends 
only on k and a € S 4 , and not on D or the o,-. The map 4> ■ St B ^ {k ^ 
cr-^[k)) e Sk- is a Si-action on K*. This action is not faithful; its kernel 
Ker<j> is the Klein group V 4 of symmetries of a rectangle (see 0.2), unless K 
is the field F 4 with four elements, in which case Ker^ = At, the alternating 
group. All orbits of K have six elements, with the following exceptions: 

i) tf K has characteristic three, the orbit of k — —1 has a single element; 

ii) If K has characteristic different from two or three, the orbit of—I has 
three elements —1,2, 1/2, and if K contains the cube roots j,j^ of 1 , the 
orbit of —j has two elements —j, —j^; 

iii) If K = Fi, there is a single orbit with two elements. 

Proof. From 6.3.1, we know that Ker^ D 1 ^ 4 . We investigate the orbit of an 
arbitrary k e K*; applying 6.3.1, the only possibilities are 

,1 . 11 k 

fr — 1 _ ic 1 ~' — - - 

’ fc’ ’ fc’ 1 - fc’ fc - 1’ 

All we have to do is check whether these are all different elements of K*] this 
can be reduced to checking the relations fc^ — 1 = 0 , fc^ — fc-t -1 = 0 , 2 fc — 1 = 0 , 
fc — 2 = 0. Bearing in mind that the only root of — 1 = 0 different from 1 
is — 1 , that the only roots of — 1 different from 1 are j,p, and that F 4 is 
obtaining by adjoining to Z 2 , we immediately obtain 6.3.2. □ 

6.3.3. Notes. In certain texts, the cross-ratio is denoted by [“|j]. This 
rectangular notation has the mnemonic advantage of indicating the action of 
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Ker^ = V 4 , the group of symmetries of the rectangle, on the matrix itself; 


a b 

—► 

b a 

c d 


d c 


c d 


d c 

a b 


& a 


For the non-commutative case, see 6.8.13. 

Figure 6.3.3 shows the graph of the function naturally associated with 
the cross-ratio function in the case A’ = R, 


A 


4 (A^ - A -K 1)® 
27 A2(l-A)2 ■ 


This function takes the same value on the points of an orbit. |DX, 43-51) 
and 6 . 8.11 give some nice facts concerning this function and the cross-ratio. 



6.4, Harmonic division 


We assume the base field has characteristic ^ 2. 

The definition of cross-ratios allows two of the four points to be the same. 
Trying to extend the definition to the case where more than two points are the 
same leads to hopeless difficulties, except in the case of the orbit { — 1 , 1 / 2 , 2 } 
mentioned in 6.3.2. Whence the following 

6.4.1. Definition. Four points a,b,c,d of a projective space or an affine 
space are said to be in harmonic division if one of the following conditions 
holds: either three of the points are the same and the fourth is different, or 
they are all different, collinear and [a, 6 , c, d] = — 1 . 

Observe that the order matters here; If a,b,c,d are in harmonic division, 
b, a, c, d are not. 

6.4.2. If a,b,c are contained in a line D of an affine space, 6.2.5 shows that 
a, b, c, ood are in harmonic division if and only if c is the midpoint of {a, 6 }. 
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6.4.3. In an arbitrary set of affine coordinates on an affine line, the points 
with coordinates a, b, c, d will be in harmonic division if an only if 2 (a 6 -|- cd) = 
(a-t- fc)(c -I- d); this is a consequence of 6.2.4. When o = 0 (resp. b = —a), this 
reduces to 2/b — 1/c + 1 /d (resp. = 6 ^ = cd). 

6.4.4. The complete quadrilateral. Let {a, fc,c, d} be a base for a 
projective plane P, and put a = (a, 6) n (c, d), 0 = {a, d) n ( 6 , c), 7 = (a, c) fl 
( 6 ,d), S = (o, c) r\{a, 0 ). Then a,c,'),6 are in harmonic division. 

Proof. Send the line D = (a,/?) to infinity (figure 6.4.4). In the new affine 
plane P \ D, we have ab || cd and ad || be by construction, so o, b,c,d is 
a parallelogram, and 7 is the midpoint of a, c (see figure 2.4.5). Now 6.4.2 
shows that [a,c, 7,5 = (a,c)oo] = — 1 - D 




6.4.5. The preceding result gives a geometric construction for the harmonic 
conjugate n of a point m with respect to two points a, b. (This is defined as 
the point n such that [a,b,m,n] = —1.) One embeds the affine or projective 
line (a, 6 ) in a plane and draws the figure indicated by the construction in 
6.4.4. 

6.4.6. Geometric construction of morphisms arising from re¬ 
flections. Let £ be a vector space and S,T two subspaces such that 
£ = 5 ffi r is a direct sum. Let q : E S he the projection associated with 
this direct sum. The reflection through S and parallel to T is defined as the 
map / such that g(x) = (a: -f /(i))/2 for all x & E. 

Harmonic division gives a construction for the morphism g = f € GP(£) 
as follows: set V = p(S), W = p{T), and take m £ P[E). There exists a 
unique projective line D(m) going through m and intersecting V and W; g(rn) 
is the point of D[m) such that 

[m, g[m),D[m) n V, D(m) D W] = —1. 

In fact, D[m) arises from a two-dimensional vector space which is well- 
determined by X and q{x) (unless x = q{x), in which case Si(m) = m, ac¬ 
cording to 6.4.1). This plane also contains /(a:); taking {x,/(x)} as a basis 
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of it, we have 

p(x + /(x)) = p{2q{x)) = V n D{m), 
p(-x + f(x)) = p{2q{x)) = W n D(m). 
This gives [m, g(m),K n£)(m),W' n Z)(fn)] = —1 by 6.1.6. 



6.4.7. Note. A common case is when 5 is a hyperplane; then K is a point 
of P(E) and D[m) is exactly the line containing this point and m. We recall 
(cf. 4.5.19) that if if is algebraically closed every involution in GP(.£r) is a 
hyperplane reflection, and thus very simple from the geometric point of view. 

6.4.8. CROSS-RATIOS AND FIELD STRUCTURE. How does one read the 
field structure on a projective line? More precisely, given, say, five points 
a, b, c, d, e of a projective line, where is the sixth point / (resp. g) such that 
[tt, 6 , c, d] + (a, 6 ,c, ej = (a, 6 , c,/] (resp. |a, fe, c, d]|a, t, c, e] = [o, 6 , 0 , 3 ])? The 
fundamental theorem of affine geometry (2.6.3) would indicate that / and 
g can be found by constructions involving only intersections of lines; and in 
fact the proof of the theorem gives the answer (cf. 2.6.6.4). The construction 
displayed in figures 2. 6 . 6 .4.1 and 2. 6 .4.4.3 has one of the four original points 
at infinity; bringing it back to the affine plane we obtain figures 6.4.8. (This of 
course requires that D be inside an affine plane, but that’s necessary anyway, 
cf. 2 . 6 . 1 .) 




Figure 6.4,8 
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6.4.9. Maps that preserve harmonic division. Along the same lines, 
constructions 6.4.5 and 5.4.8 suggest that a bijection f : D —* D from a pro¬ 
jective line onto itself preserving the property of four points of being in har¬ 
monic division cannot be arbitrary, and possibly has to be a semimorphism. 
This is indeed the case; 

6.4.10. Proposition (von StauDT). Let D = P(E) be a projective line 
and f : D —* D a bijection. The property that for all distinct a,b,c,dBD with 
[a,6,c, d) = —1 the images f(a),f(b),f(c),f(d) also satisfy [f(a),f{b),f{c), 
f(d)] = — 1 ts equivalent to the existence of a semilinear map f ■. E —* E such 
that f — f. 

Proof. Let a : if —* A’ be an arbitrary automorphism of K, and let / be 
semilinear for o. Then 

but o-(—1) = — 1 because o is an automorphism. 

To prove the converse, identify K with D and assume, without loss of 
generality, that / fixes 0 ,1 and oo (since the composition f o g ol f with 
an appropriate element g of GP(A’) fixes those three points and preserves 
harmonic division if / does). In particular, / can be seen as a map f : K K. 
By 6.4.2, we have 

+ ^ J[^) + f{y) 

for all x,y & K, whence /(a:/2) = f(x)/2 for all x € K. Thus f(x + y) = 
f(x) + f(y) for all x,y € K. We now use 6.4.3: the points {l,x^,x,-x} are 
in harmonic division, hence so are {/(I) = 1,/(x^),/(x),/(—x) = —/(x)}; 
again by 6.4.3, this implies f(x^) = (/(a^))^- Applying this formula with x + y 
instead of i gives f(xy) = /(i)/(y), concluding the proof that / is a field 
automorphism. □ 

6.4.11. Corollary. A bijection of the real projective line preserving har¬ 
monic division is a homography. A continuous bijection of the complex pro¬ 
jective line preserving harmonic division is a homography or the complex con¬ 
jugate of one. 

Proof. This follows immediately from 2.6.4. See also 18.10.2.4. □ 

6.5. Cross-ratios and duality; applications 

What b a line of P(E*), that is, a “line of hyperplanes”? The answer 
comes from 2.4.8.1 (recall also 4.1.3.5): 

6.5.1. Definition and proposition. A projective line A of P(E^) is 
called a pencil of hyperplanes of E (a pencil of lines if dim(P(£7)) = 2, a 
pencil of planes if dim(P(£)) = 3.^ There exists a codimension-two subspace 
V of P[E) such that A ={ H € )i(E) \ H D V }. Conversely, for every 



6.5. CroBs-ratioB and duality; applications 


131 


codimension-two subspace V of P(E) the set \^H & )I{E) \ H D V } C P(E’‘) 
is a line. Moreover, for every x 6 P{E) \ V there exists a unique H & A 
containing x. □ 

The following is a simple result, but has numerous applications: 

6.5.2. Proposition. Let (ffi)i=i. 2 . 3.4 belong to a single pencil of hyper¬ 
planes of P(E), associated with the codimension-two subspace V. Assume 
the first three of these hyperplanes are distinct, and let D be a line of P(E) 
such that DnV = 0. Then, for any i, the set Df\Hi consists of a single point 
hi, and the two cross-ratios |hi] and are equal, where the first is consid¬ 
ered on the line D and the second on the line of P(E") which corresponds to 
the pencil of hyperplanes under consideration. 

Proof. This follows, for example, from the next result. □ 



6.5.3. Lemma. Let A be a pencil of hyperplanes, V the associated subpace 
and D a line which does not intersectV. Then the map A 3 H >-* HnD 6 D 
lies in Isom(A;Z?). 

Proof. Let H, K €. A he distinct, set a = D r\ H and b = D n K, and take 
6 £■* such that H = p[<j>~^[0)), K = p(\li"'*(0)). Normalize t/i so 
that <f[y) = —ip[x) = 1 for some x,y satisfying p(x) = o, p{y) = b. By 
construction, we have 

£>= {p(Ax+/iy) I (A,/z)e/f^\0}, 

The intersection with D of the element of A with coordinates (^, tj) is the 
point of D of coordinates {A,/i) given by 

(^<^ + r?t/>)(Ax + //y) = 0. 

This reduces to = tjA, so in P(K^) we have p(A,/i) = p(C, »j), and the 
desired map is in fact the identity in these coordinates. □ 
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Figure 6.5.3 

6.5.4. Corollary. The cross-ratio of the four intersection points of the Hi 

with D does not depend on D. □ 

6.5.5. Thales’ theorem (second proof) (cf. 2.5.1). The data are 

an affine space X and three parallel hyperplanes in X. In the 

projective completion X of X, one obtains four hyperplanes H, H', H", oox 
which belong to the same pencil. Thales’ theorem follows now from 6.5.2 and 

6.2.5. 

6.5.6. In the same vein, we can calculate the cross-ratio of four collinear 
points a, 6, c, d of an affine space using the construction shown in figure 6.5.6 
(after extending the line into a plane, if necessary). Draw a line a\b',c' 
parallel to (m, d); then 

I k 

c'b' 

This follows from an application of 6.5.4 to the two lines a,b,c and a',b',c', 
together with 6.2.5. 

One could thus build a purely affine, and, in fact, metric, elementary 
theory of cross-ratios. 

6.5.7. Polar of a point with respect to two lines. Consider 
in a projective plane two distinct lines D, D', and take x ^ D U D'. A 
line through x intersects D,D' at m,m'\ the harmonic conjugate of x with 
respect to m, m' describes a line through DnD’, namely, the line F satisfying 
[D,D',F,{x,DnD')] = -1 in P{E*). This line is called the polar (line) of x 
with respect to {D, D'}-, a reason for the name can be found in 14.5.6. Using 
this notion, it is easy to reprove 6.4.4, since the polar of a' (figure 6.5.7) with 
respect to {(a, d), (/3,fc)} is the same as with respect to {(a, fr), (/?, d)}, and 
thus goes through a and c. 

6.5.8. The following two dual results are proved using the same technique: 
let {a, fc,c, d} be distinct and collinear points, and similarly {a, 6', c', d'}. 
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Then the lines {b,b'), (c,c'), {d,d') are concurrent if and only if [a, 6 , c,d) = 
[a, 6', c', d'j. Let {D, E, F, G} be distinct lines in the same pencil of lines of a 
projective plane, and similarly for {D,E',F',G'}. Then the points EnE', 
F n F', G n G' are collinear if and only if \D, E, F, G) = |D, E', F', G'). 

6.5.9. Another way to present 6.5.8 is as follows: Let m,m' be points of a 
projective plane giving to the pencils A, A', respectively. Let / : A —» A' be 
a homography, / e lsom(A; A'), such that /({m, m')) = (m, m'). Then the 
set { Z> O / (D) I £) e A } consists of the line (m, m') union a second line S. 
(When /((m, m')) ^ (m, m') the set is a conic going through m and m'; see 
16.1.4.) We leave to the reader the statement of the dual result. 




6.5.10. Cross-ratios and projective coordinates. Consider a pro¬ 
jective base {m,}i=o.i.n + i Jind a an arbitrary point m. By 4.4.3, the asso¬ 

ciated projective coordinates [xi ,..., in + i) of ^ b* this base are only defined 
up to a scalar, that is, their ratios Xifxj determine m. The next proposition 
allows us to find these ratios geometrically: 
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6.5.11. Proposition. Given two indices i,j (i ^ j), and the hyperplanes 
Hi), H defined by 

Ho = 

the ratio Xi/xj is given by 

— = n Ho, {mi, my) n H]. 

Iv 



Proof. Fix a basis {et}j=i.„+i associated with the projective base being 

used (cf. 4.4.2). Then (A,/x) are coordinates for (mi, my) under the homog- 
raphy 

p(A,/i) — p(Xei,pej). 

We have (1,0) >—* p(e,) = m,, (0,1) p(ey) = my, 

( 1 , 1 ) I-* p(ei + Cj) = (m., my) n Ho, 

(xi, Xj) p(iie, + lycy) = (my, my) n H. 

This proves 6.5.11, in view of 6.1.6. □ 

6.6. Homographies of a projective line 

The group GP(A^) (or GP(£>), where /? is a projective line, the two 
groups being isomorphic), its subspaces and its elements have been studied 
in great depth, especially for if = R and C. They are very simple groups, 
related to a number of objects in analysis, arithmetic and differential geom¬ 
etry. Here we present only a few elementary results, the barest outline of 
the theory; on the other hand, we include numerous references, especially to 
work that in some way uses these groups. For a more systematic study, [GN, 
chapter IV) is an excellent reference. 
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6.6.1. Proposition. Let D be a projective line over an algebraically closed 
field K, and let J € GP(I?) be different from the identity. Then f has one or 
two fixed points. If a,b are distinct fixed points of f, the ratio [a, fc, m, f(m)] 
is constant for m ^ D \ {a,fc}, and lies in K \ {0,1}. Conversely, given 
k € K \ {0, 1 } and distinct points a, 6 € D, the formula [a, 5, m, /(m)] = k 
defines a homography of D with fixed points a, b. If f has a unique fixed point 
a, it is a translation of the affine line D \ {a}. 


6.6.2. Proof. By 4.5.17, we know that / has at least one fixed (or “double”) 
point. By 4.6.9, there can be at most two. Send one fixed point a € Z? to 
infinity; then the restriction of / to the affine line D\{a} is an isomorphism. 
Since D is one-dimensional, this restriction is a dilatation, and two cases are 

possible. K it is a homothety, let its center be b; we have bf[m) = kbm 
for all m 6 \ {a}, and 6.2.5 yields the desired conclusion. Otherwise the 

restriction of / is a translation and the proposition is proved. □ 

When there are two distinct fixed points, the constant value of the cross-ratio 
[a, 6 , m, /(m)] is given by the following lemma: 


6.6.3. Lemma. Put D = P(E), and let g € GL(£7) be such that f = g. If 
a,b D are distinct fixed points of D, let A and ft be the eigenvalues of g 
associated with the eigenvectors x,y £. E such that p(x) = a, p(y) = b. Then 
[a, b, m, /(m)] = A/p for any m €: D. 

Proof. Choose a projective frame (a, 6 ,m}, with a = p{et)) ^ = p{e 2 )> ^ = 
p(ci + ej). The matrix of g must be (o°), so f[m) = p(Aei -I- fte^), proving 
the lemma by 6 . 1 . 6 . □ 


6 . 6 .3.1. Even if the base field is not algebraically closed, 6 . 6.1 holds as long 
as there is one fixed point. It may happen that there are no fixed points, for 
example Zf = R, M(f) = (j This can be written, in the notation of 


5.2.4: 


ti—*—- (t ^ 0 ), oo I—♦ 0 , 0 oo. 

6.6.4. Image and object focal points. Whether or not there are 
fixed points, one can describe a homography as follows: write D — X \J oo, 
where X is an affine line (this is equivalent to choosing a point oo on D). 
The image (resp. object) focal point of / € GP(I?) is the point a = /~*(oo) 
(resp. 6 = /(oo)). Unless oo is fixed, we have a,b ^ X. By 6.1.4, 


(oo,a,m,nj= [b,oo,/(m),/(n)] 


for any m,n€: D. Applying 6.2.5 and 6.3.1 we obtain 


an bf (m) 

^ 67h’ 

that is. 


6.6.5 


am ■ bf[m) = constant. 
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This formula determines / if one knows the focal points and one pair (n, /(«))■ 
It also works when a — b. 

The name “focal points” comes from geometrical optics, where one proves 
that a centered optical system, under the Gauss approximation, associates to 
each point of the axis of the system (union infinity) another point of the axis, 
and that this correspondence is a homography. One can start by showing 
this for a single diopter or mirror, using classical formulas; the extension to 
arbitrary centered optical systems, which are by definition composed of a 
finite number of diopters or mirrors having the same axis, follows by taking 
the composition of the individual homographies (cf. 4.5.9). On the subject of 
geometrical optics, see, for instance, (A-B, chapter 4). 



Figure 6.6.5.1 




Figure 6.6.5.2 (Source: |BOU]) 


6 .6.6 . Geometric constructions. For geometric constructions on the 
homographies of a line, see 16.3.10.1. 

6.6.7. Notes. Here are some references on homographies in general and 
their applications in various areas of mathematics: |GN, chapter IV] is a gen¬ 
eral reference; for differential equations, see 6.8.12; for automorphic functions, 
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see [SA); hyperbolic geometry is discussed in chapter 19; [VL] has a pleasamt 
miscellany of geometric applications; 18.10.2.2 and |CH2, 183] discuss the 
homographies of P(C^) = S^. 

6.7. Involutions 


6.7.1. Definition. An involution of the projective line D is a komography 
f of D such that = Idc and f Idp. 

This differs slightly from the standard definition of an involutive map, which 
allows the case / = Id. Prom section 6 . 6 , we immediately get the 

6.7.2. Proposition. If an involution f has a fixed point, it has two fixed 
points a, b, and is determined by the condition [a, b, m, /(m)] = —1 for all m. 
Whether or not f has a fixed point, its restriction to the affine space D \ oo 
is characterized by 

am af(m) — constant, 

where a is the (image and object) focal point of f. In particular, if K is 
algebraically closed, f is always of the form [a, fc, m,/(m)] = — 1 ; if K = IR., 
either f has fixed points, or it can be written am ■ af(m) = —1 in some 
appropriate affine frame. 

The foUdwing statement collects some simple properties of involutions: 

6.7.3. Proposition. Every homography is the product of at most three 
involutions. A homography f is an involution if and only if there exists m 
such that f^(m) = m and f{m) ^ m. A homography with matrix A = (‘*^) 
is an involution if and only ifTrA — a + d — 0. 

Proof. By composing / with an involution if necessary, we can assume that 
/ has a fixed point, which can be taken to be oo. By 6.6.2, / is either a 
translation x i-* i + f or a homothety i t—» Ax of an affine line. In the first 
case, it is the composition of the involutions x h-♦ —x and x h-♦ t — i, and, in 
the second, of the involutions x i—♦ A/x and i i—* 1/x. 

If f^{m) = m and f(m) ^ m, we can take m = 0 and f(m) = oo, whence 
/(oo) = 0. By 5.2.4, we must have c yi 0, d = 0 and a = 0 , so / is of the form 
t I-+ b/ct, which is indeed an involution. 

Writing 

«(/)=(::), «(n 

we have /^ = A Id only if b(a + d) = c(a + d) = 0 and a^ + be = d^ + be = 1 . 
If a + d 0 we have b = e = Q and a = d, hence / = Id. □ 

6.7.4. Notes. The reader can easily check the following facts: an involution 
is determined by two pairs (a,/(a)) and {b,f(b)) (with a y^ 6 ); two distinct 
involutions coincide at exactly one point if iif is algebraically closed. These 
assertions will also follow from the corresponding geometric constructions, 
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given in 16.3.10.1. Involutions will be encountered again when we discuss 
Desargues’s theorem on pencils of conics (16.5.4). Finally, see [GN, chapter 
IV) or, if unavailable, 14.8.16. 

6.8. Exercises 

* 6.8.1. Let x,y,z,u,,v be five points on the same projective line. Show that 
the following always holds: 

[x, y, u, «)|y, z, u, v]\z, X, u, u] = 1. 

6.8.2. Show the theorems of Pappus and Desargues (5.4.1, 5.4.3) by using 
the statements in 6.5.8. 

ax + fc 

6.8.3. Consider a homography x h-* -- of the real line R. Study whether 

cx + a 

there is any relation between the existence of fixed points and the sign of the 

derivative ol x y-* -— 

CX+ d \ c ) 

6.8.4. Consult a book on geometrical optics (for instance, |A-B, chapter 4j) 
and study the nature of the homographies associated with: a (plane, convex 
spherical, concave spherical) mirror, a plane diopter, a thin lens, a thick lens. 
Study the reduced formulas given in the book. 

6.8.5. For if = R or C, study the behavior of the iterates /" (n 6 Z) of a 
homography of a projective line, according to the nature of /. 

6.8.6. Is every homography the product of two involutions for K = C? For 
/f = R? 

* 6.8.7. Let / be a homography with two distinct fixed points o, 6; show that 
the set {fc, l/k}, where k = [a, 6, m,/(m)] for every m, depends only on / 
and not on the choice of the order of a,b. If / has matrix M = (“ ^) show 
that {k, 1/k} are the roots of the equation 

(a5 - + 2M + + (a(5 - M) = «• 

* 6.8.8. The data are those of 6.8.7, and moreover K = C. We say that / is 

elliptic if the complex number k (or 1/k) has absolute value 1, hyperbolic if k 

is positive real, and hxodromic otherwise. Show that after normalizing M(f) 
by aS — P') = 1, we can characterize these three cases by using the trace t of 
/, given by f = a + 5 : 

{ elliptic •<==>■ a + (5 is real and |a + o| < 2, 

hyperbolic •<==>• a + 6 is real and |q! + S\>2, 

loxodromic <=> a + 5 is not real. 

Study, for the three cases considered, the nature of the iterates /"(n 6 Z). 
The word “loxodromic” is justified by the fact that the iterates /"(z) of a 
point z of the Riemann sphere C U oo all belong to the same loxodrome of 
the sphere (cf. 18.1.8.2 and 18.11.3). 
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6.8.9. Let / be a homography of C U oo, with matrix (“^). Show that a 
necessary and sufficient condition for / to fix the upper half plane H = {x € 
C I Im(z) > 0 } is that a, 6, c, d be real and that ad — be > 0. For such an /, 
study the nature of the fixed points and of the iterates of /, as a particular 
case of 6.8.8. Show that if commuting homographies / and g both leave H 
fixed, they have the same fixed points. 

6.8.10. In the notation of 6.3.2, show that the stabilizer of {4} is isomorphic 
to Sz and acts faithfully if /f ^ F^. 

6.8.11. Let ax^ + fcx® + cx^ 4- dx 4- e = 0 be a fourth-degree equation over C. 
Write the condition on a, h, c, d, e for the four roots of this equation to have 
cross-ratio —j or (cf. 6.3.2 (ii)). 

Find the condition for the four roots to be in harmonic division (messy). 
Find the six possible values of the cross-ratio in terms of a,b,c,d,e (very 
messy). Cf. [DX, 43-51]. 

+ 6.8.12. RICATTI differential equations. For a,b,c : [a, -♦ R contin¬ 

uous functions, consider the differential equation y'(t) = a(t)y^ + b(t)y + e{t), 
called a Ricatti equation. Show that if j/i (i = 1, 2, 3,4) are four solutions of 
this equation, the cross-ratio [vi(t)] is independent of t. 

6.8.13. Let R’ be a skew field, E a two-dimensional vector space over K, and 
a,b,c,d four distinct points of the projective line P(E). The cross-ratio of 
(o, 6, c, d), denoted by [ “ J)] , is the set of | € if such that there exbt u,v B E 
satisfying a = p(u), b = p[v), c = p(u 4- u), d = p(fw + v). Show that [" j] is 
the set of conjugates of a fixed element of the multiplicative group K* (under 
the action of inner automorphisms). State and prove a converse. Find out 
which results of this chapter involving cross-ratios still hold. 



6.8.14. Let K be a convex compact set of a finite-dimensional real affine 
space. Assume K has non-empty interior and all its boundary points are 
extremal. Define the function d : if x if —» R as follows: d(m, m) = 0 for all 
m B K and 

d(m,n) := -|log(|m,n,a,/9|)| 
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for m ^ n, where a, denote the two intersection points of the line m, n with 
the frontier of K. Show that d is well-defined and makes K into a metric 
space. Show that, for any m,n K, there exists a shortest path from m to 
n under this metric, and that it coincides with the segment [m, n] (cf. 9.9.5; 
see also 11.9.4). 

6.8.15. Study the homographies of a projective line over a skew field. 

6.8.16. Show that every homography of C U oo which leaves the unit disc 
X>={ 0 eC|| 2 r|<l} globally invariant is of the form 

ie z + zo 

z I—► e -r—, 

1 + zoz 

where 9 is real and |zo| < 1. Show that the group of such homographies is 2- 
transitive over 1) (cf. 1.4.5), and preserves the distance d of 6.8.14. Compare 
with 19.6.9. 


* 6.8.17. Let (ai),=;i & be points on a projective plane, so that the first 

four form a projective base. Denote by dij the line (ai,aj). Show that the 
following holds: 


(dl2) ^13i dlit di5]ld23,d2l,d24.d2b]\d31, d32, d34, dss] = 1. 

Show that a necessary and sufficient condition for the existence of a homog¬ 
raphy taking the (a,),=i.jj into new points (a| )t=i.s is that the following 

two equalities be satisfied: 

(di2i di3, di4, dis] = {d\2> d’i3, d^, dig] 


and 


[^231 ^ 2 I > ^24 I <^20 ) — 1 <^ 23 > ^21 > *^24 > *^ 251 > 


where we have put = (a', o' ). 



6.8.18. Study the notion of harmonic division in the non-commutative case, 
skew field What happens to 6.4.10? 

6.8.19. Why is there no point c in figure 6.4.8? 
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6.8.20. Ruled surfaces. Let 5 be a ruled surface in R^, that is, the 
union Ute/ ^ family D(t) of lines in R®, parametrized by t G /, where 

/ is an open interval of R. Assume that, at every point m of D(to) (to € /)i 
the surface 5 is a differentiable submanifold of R^, and denote by r(m) the 
tangent plane to S at m. Show that m i-+ T[m) is either a homography or a 
constant. Compare with 14.4.4. 

* 6.8.21. Let r be a tetrahedron in a three-dimensional projective space and 
let D be a line. Show that the cross-ratio of the four intersection points of 
D with the faces of T is equal to the cross-ratio of the four planes passing 
through D and the vertices of T. 

6.8.22. See exercise 14.8.16. 



Chapter 7 

Complexifications 


The aim of this chapter is to present an intrinsic definition of 
complexification for real vector, projective and affine spaces. We 
also study the problem of when morphisms and subspaces can 
be complexified, and finally (section 7.6) show that completing 
a real affine space into a projective spaee and then complexify¬ 
ing gives the same result as first complexifying and then taking 
the completion. This fact will later be used to introduce, in an 
intrinsic way, the cyclic points of a Euclidean affine space (9.5.5). 

The intrinsic process for complexification is lengthy, though 
not really difficult if one knows about the universal space X 
associated with an affine space X (chapters 3 and 5). Nonethe¬ 
less, the exposition is preceded by an introduction giving a much 
shorter, albeit non-intrinsic, construction. 
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In this chapter i denotes the point (0,1) of C. 


7.0. Introduction 

7.0.1. In this book we shall have to complexify several objects, for example, 
real vector spaces and their endomorphbms (to find eigenspaces, see 7.4.3), 
Euclidean spaces and the quadratic form defining them (to introduce isotropic 
vectors and characterize similarities, see 8.8.6.4), and projective spaces (to 
introduce cyclic points, see 9.5.5). The process of complexifying a real object 
is natural and fundamental in mathematics; as Hadamard said, the shortest 
path between two real results often goes through complex terrain. 

7.0.2. The simplest process, and one that would be sufficient for us, to 
complexify a real vector space E of finite dimension n, is the following: Take 
a basis for E, and embed E in C”' via the map 

E ^ X ~ (li ,. .., Xfi) ^ (^Cl ,. *., ^n) € C! , 

where x,- is the i-th coordinate of x with respect to the chosen basis. The same 
process works for a real affine space of dimension n: work with an affine frame, 
and consider in C" its natural affine structure (cf. 2.2.1). To complexify a 
Euclidean space and also the quadratic form defining it, we take a basis, write 
the quadratic form as ^ OijXiXj in this basis, and consider C" with the 
quadratic form St.yfor ( 2 i,...,z„) € C"; E is embedded in C" as 
above. To complexify a real projective space, we complexify the vector space 
from which is arises, and projectivize the result. 

The most complicated situation which we shall discuss is the following: X 
is an affine vector space, X its projective completion (cf. 5.1), and we desire 
to complexify both X and X, in such a way that the complex projective 
space obtained from X is the projective completion of the affine complex 
space obtained from X. This is easy to do using an affine frame of X: in 
the notation of 5.0.2, one starts by introducing the complexification C" of 
X and the embedding X —» C" above. The projective completion X of 
X is identified with P"(R), and the embedding X —♦ P'*(R) is given by 
(xi,...,x„.) I—* p(xi,..., x„, 1). But P”(R) can be naturally complexified 
into P'‘(C), with the obvious inclusion P"(R) C P"(C); thus, the embedding 
C" —» P"(C), given by (zj,... ,z„) i-* p(2i,..., z„, l), is indeed the same as 
the embedding of C", considered as a complex affine space, into its projective 
completion. In other words, we have a commutative diagram 

7.0.2.1 

C" —. P"(C), 

where the vertical arrows are complexifications and the horizontal ones pro¬ 
jective completions. 
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7.0.3. The observations above would be enough for the problems discussed 
in this book. Nonetheless, the constructions described in 7.0.2 have the fun¬ 
damental drawback of depending, a priori, on the choice of a basis. It might 
be argued that this flaw is merely esthetic, and that a concern with elegance 
and naturality is not essential; but such a concern certainly has been around 
for a long time, and, in the particular case of the constructions of 7.0.2, it 
gave rise to a long argument around the “Poncelet continuity principle”. This 
principle, as quoted in [GX, 72), goes as follows: “Consider a figure F formed 
by points, straight lines and curves, and suppose that F moves around, under¬ 
going a continuous deformation. Assume that, for a position F' of the figure, 
some of its elements have become imaginary; a property of F which does not 
involve these elements, but which may have been established by using such 
elements, will nonetheless apply to F' as well.” The principle transcends the 
framework of complexifications and in fact is connected with algebraic geom¬ 
etry, but we have quoted it because of its historical interest and its obvious 
relation to 7.0.2. 

Another quotation, from (DX, 15], is relevant here; it belongs to a period 
between the inception of the use of imaginary numbers in geometry and the 
appearance of today’s intrinsic complexifications. “Before we continue, it 
may perhaps be useful to respond to an objection that von Staudt addressed 
to the analytic theory of imaginary numbers, and which would apply equally 
well to the theory just presented. This learned geometer maintained that the 
existence of imaginary points, as we have defined them, somehow depends on 
the coordinate axes; and this objection would apply to any object studied by 
means of such points. To rebut this objection, it is sufficient to show that, 
if an imaginary point belongs to one or more surfaces, it will still belong to 
them after a change of coordinates. This is certainly the case if one agrees 
to apply to imaginary points and points at infinity the change of coordinate 
formulas that have been proved for real points.” 

7.0.4. A more sophisticated point of view would consist in using 7.0.2 but 
checking, whenever necessary, that the objects considered in the complexifi¬ 
cations depend only on the objects they come from in the real spaces, and not 
on the bases chosen to perform the complexification. For example, one can 
check, when complexifying a Euclidean vector space and its quadratic form, 
that the isotropic cone of the complexification depends only on the Euclidean 
structure. 

7.0.5. This whole discussion shows that an intrinsic formulation for the 
constructions in 7.0.2 is not at all out of place, which is why we develop it in 
this chapter. We do it tersely, however, skipping trivial demonstrations and 
some details in the statements, and, more important, defining only explicit 
canonical complexifications, and not complexifications in themselves, whose 
existence and uniqueness up to a canonical isomorphism would have to be 
proved. For this latter approach, admittedly more elegant but also more 
costly, the reader is referred to [FL, 144-155). 
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7.0.6. NOTB. The reader may have suspected that there exist generalizations 
of the notion of complexification; for an example, see [GN, chapter IV), which 
deals with quadratic extensions (more general than the extension C D R). 

7.1. Complexification of a real vector space 

Complexifying a real vector space consists in embedding it in the smallest 
possible complex vector space, like going from R to C. This is accomplished 
as follows: 

7.1.1. Definition. Let E be a real vector apace. The complexification of 
E, denoted by E^, la the product E x E, endowed with the complex vector 
structure defined by 

(x, y) + (x', y') = (x + x', y + y'), 

(A + y) = (Ax - py. Ay + px). 

The embedding E —* E^' is given by x t-* (x, 0). The involution 

a : (x,y) h-» (x,-y) 

from E^^ into itself is called the conjugation in E^. We have 
E = Ker(cr — Id^c) = { z € E^ ) ct(z) = z }• 

7.1.2. Remarks. 

7.1.2.1. The conjugation map is semilinear: cr(Xz) = Aa(z). 

7.1.2.2. Once we have 7.1.1, we can observe that t(y,0) = (0, y) — iy, so 
we can write (s, y) = x + ty and E^ = E ®iE (a real direct sum). Then 
ct(i + iy) = X — iy, which justifies the word conjugation. 

7.1.2.3. Other equally good (and equally gratuitous) definitions are: 

= iR(C; E) = {f :CE\ f is R-linear }, 

E'- = £■ ®R C. 

For a universal definition, see |FL, 146). 

7.1.3. Bases. Let {e,},=i. „ be a basis for E-, then {e,} C E'' is a basb 

for E'' (over C, not over R). Thus 7.0.2 is justified a posteriori. Conversely, 

if {e!i}i<=:i. n is an arbitrary basis for E'' (over C), it is a fundamental fact 

that the 2n vectors 

{e',+a(e:,)}u{j(e',-o(ai))| 

are in E, being invariant under a. Thus one can choose from among these 2n. 
vectors an R-basis for E. 
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7.2. Functoriality; complexification of morphisms 


7.2.1. Proposition. Let E,E',E" be real vector spaces, and take f € 
L{E\E'). There exists a unique 6 Lc(E^such that = /. 

Also (TO f^' = o(x, where a denotes conjugation in both E and E'. Finally, 
complexification is functorial, that is, (He)^ = Id£;f: and (go f)'' = o f ' 
for feL(E-,E'),geL(E'-,E"). 

Proof. It is enough to observe that, for any x,y B E, 

f^'(x + iy) = f[x) + if{y) 


(cf. 7.1.2.2), and that the diagram below commutes: 


E^' ^ 

E>c 


1 " 

1 " 

□ 

E*' 

E"^. 



7.2.2. If M(f) is the matrix of / with respect to a pair of bases of E,E', 
then M{f'^) = M{f) with respect to the same bases, considered as bases of 
E'\ E''^' (cf. 7.1.3). 


7.3. Complexification of polynomials 


7.3.1. Proposition. Let <f>: Ex-xE—^lBubea k-linear map over a real 
vector space E. There exists a unique k-linear map 

<i,^' : E'^ X ■ X E'^ C 

(over C) such that 4 >^''\ex - xE — 4>- Moreover, if (f is symmetric, so is . 
Proof. The value of (fP must necessarily be 

7.3.2 (xi + iy „..., xfc + iyfc) = 4'[X ^,.... X^), 

p 

where P ranges through the 2*' subsets of {1,2,..., fc), j(P is the cardinality 
of P, and is equal to x, if s € P and to y, if s ^ P. It is easy to check 
that 4>‘' satisfies the stated properties. □ 

7.3.3. Example. For it = 2, 

4>'-^(x + iy, x' + ly') = ^(x, x') - <^(y, y') + t(^(i, y') + ^(x', y)). 

7.3.4. Proposition. Let f e Pk[E). There exists a unique f ' e Pk(E'’) 
such that f^'ls = /■ 

Proof. By 3.3.1, we can assume that / is homogeneous of degree k. Let f = 
A, where ^ —> Rbe a symmetric multilinear map. Put /^' = <f>' oA^"", 

where is the map given in 7.3.1 and A*^' : E^ —» E^ x ■ ■ ■ x E'^ is the 
diagonal map. Then satisfies the required properties, and is the only map 
to do so by the uniqueness parts of 3.3.1 and 7.3.1. □ 
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7.3.5. Fixing a basis for E and working in coordinates, one can easily obtain 
by taking formulas 3.3.3 and replacing the real numbers by arbitrary 
complex numbers. 

7.4. Subspaces and compiexifications 

7.4.1. Proposition. Let E be a real vector space, F a vector subspace of 
E. The C-vector subspace of E^ spanned bp the subset F C E'^, denoted 
by F'', is equal to F'‘ — F + iF — {a: + ty | x,y e F}, and is called 
the complexification of F (in E''); it is invariant under a. Conversely, if 
S C is a (complex) vector subspace of E'' such that a(S) = S', the 
intersection S n E is a (real) vector subspaee of E, and its real dimension 
dimR(S n E) is the same as the complex dimension dime S of S. Moreover, 
one has S = (S n E)^'. 

S 


0 

SHE 

Figure 7.4.1 

Proof. The non-trivial part is the second half. It is clear that S fl F b a 
vector subspace of F, and that (SnF)^’ C S; to prove equality from the fact 
that <t(S) = S, we use the same idea as in 7.1.3: for every x € S, the vectors 
X + cr(x) and (l/i)(x — cr(x)) are in S n F. Then 

(x + a(x)) + t Q(x-<T(x))^^ € (SnF) + i(SnF) = (SnF)''. □ 

7.4.2. Notes. It is difficult to illustrate this proposition with a drawing, 
since the first non-trivial case requires dime S = 1, dime F = 2, hence 
dimR F = 4. Notice that 7.4.1 is false if cr(S) / S; take, for example, F = R^, 
E'' = C2, S = { {z,iz) I « e C }. Then S nR^ = 0, so (S n F)'- = 0. 

Observe how we have been systematically using a (cf. 7.1.1, 7.2.1, 7.4.1); 
this shows we have to carry around the conjugation a whenever we discuss 
the space E'\ It can, in fact, be proved that giving E'^ and <x is equivalent 
to giving F and F'^; cf. [FL, 146], where this is the basis for the definition of 
a complexification. 

7.4.3. Corollary. Let f g L[E\E) be an endomorphism of a finite¬ 
dimensional real vector space. Then f admits a one- or two-dimensional 
eigenspace P. 
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Proof. Let K be a one-dimensional eigenspace of for the endomorphism 
with eigenvalue A. Then (t(V) is a one-dimensional eigenspace, with 
eigenvalue A (cf. 7.1.2.1). Thus the complex vector subspace S' = K + <r(V) 
is invariant under cr, and has dimension one or two. The desired subspace is 
now E n S. □ 

7.4.4. This result, the real version of the theorem of existence of eigenvectors 
for complex endomorphisms, is very useful (cf. 8.2.15, 13.5). 

7.5. Complexification of a projective space 

7.5.1. Definition. Let P = P{E) be a real projective space. The complexifi¬ 

cation of P, denoted by P'^', is the complex projective space P'' = P[E'^). 
We identify P with a subset of P'^ by taking the quotient of the inclusion 
E\0 E'' \ 0. The map P'^ —* P*' obtained from a : E‘^ \ 0 —» E'^ \ 0 

by passing to the quotient is also denoted by a, and called the conjugation in 

P'\ 

7.5.2. Observe that P is not a projective subspace of any more th 2 tn 
E is a. (complex) vector subspace of E^. Observe also that tr : P^' —> P^ 
takes collinear points into collinear points, and is, in fact, a semimorphism 
(cf. 5.4.8). Finally, P has the same real dimension as the complex dimension 
of P'', just like E and £7'^. 

7.5.8. Proposition. Let E be a vector space. Given f e GP(.E), there 
exists a unique € GP(£'^') such that f‘^\p^B) = /• D 

7.5.4. Proposition. Let P be a real projective space, S C P a projective 

subspace. The (complex) projective subspace of P^' spanned by the subset S 
of P'', denoted by , satisfies cr(5^'') = S'', S = S'' n P, and is called the 
complexification of S (in P''). Conversely, let T be a projective subspace of 
P'' such that cr(r) = T. Then TnP is a (real) projective subspace of P whose 
real dimension is the same as the complex dimension of T. Furthermore, 
\Tr\P)''=T. □ 

7.5.5. Calculations are performed using homogeneous coordinates associated 
with a basis of £7 or a projective base of P[E), which remain, respectively, 
a basis of E'' and a (complex) projective base of P[E). It is enough to use 
complex coordinates instead of real ones. Again, this furnishes an a posteriori 
justification for 7.0.2. 

7.6. Complexification of an affine space 


7.6.1. Let [X,X) be a real affine space. Embed X in its universal vector 
space X (cf. 3.1.7), and consider the inclusions X C ^ C , where X'' 
denotes the complexification of the real vector space X. Recall (cf. 3.1.7) 
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that X = where M is a linear form over X. By 7.2.1, there exists a 

well-defined linear form X*^ —* C, the complexification of M ; X —» R, which 
we denote by M^'. Finally, put 

X'^ = C X^. 

Then X'' is a (complex) affine hyperplane of the complex affine space 
considered with its natural affine structure (cf. 2.2.1). In particular, X''"’ is a 
complex affine space. 

7.6.2. Definition and proposition. The complexiBca-tion of the real 
affine space X is, by definition, the complex affine space X'' = (M'^)“*(l). 

The vector space underlying X'' is X'^' = X^, and we have a natural inclusion 
X C X*"’ . The conjugation a of leaves X^’ invariant, and we have X = 
{ z & X'-" I o(z) = z }. The restriction of a to X^', still denoted by cr, is 
called the conjugation of X''. Vectorializing at an arbitrary a & X, we have 
{X‘")a = (XaY^ and a natural isomorphism X'-'' = X^'. 

Proof. The vector subspace underlying X'' is (M‘''')~^(0), which is indeed 
the same as X'' because Af~*(0) = X. Moreover, 

a[r') = o((A/'-r‘(l)) = 

by 7.2.1. On the other hand, for x € a[z) = z is equivalent to z £ 
X'-' n X = X. The equality (X^')a — (Xa)^^' follows from the last part of 
3.1.7. As for the isomorphism = X^, use the definition of universal 
spaces. □ 

7.6.3. Proposition. Let X,X' be real affine spaces and f £ A(X;X') a 
morphism. There exists a unique f^ £ Ac(X‘^;X'*^') such that f^\x — /• 
This IS called the comp/exification of f. We have a o f^ = f*^ o a. 

Proof. Apply 7.2.1 to the vectorializations X„ and X^(^|, and use the last 
result in 7.6.2. □ 

7.6.4. For X a real affine space and S a subspace of X, one has a result 
analogous to 7.4.1, namely, = S''. 

7.6.5. Similarly, one can extend 7.3 to polynomials on X; it suffices to apply 
the criterion given in 3.3.14, together with 7.6.2 and 7.3.4. 

7.6.6. We can now discuss 7.0.2 as a whole, especially diagram 7.0.2.1, which 
is the central point of this chapter, and will be used in 9.5.5 and chapter 17. 
Let X be a real affine space and X its projective completion (cf. chapter 5): 

X = P(X) = X U P(.^) = X U oox. 

Introduce the affine spaces X'', X'^, and the complex projective space (X)'' = 
PjX''), the complexification of the real projective space P(X) = X. Inside 
(X)'-' is embedded the complexification oo^ f the real subspace oox (cf. 7.5.4). 
The inclusion X'' C P(X^'') = X^" gives the following result: 
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7.6.7. Lemma. X‘' ^ .Y^\oo^. 

Proof. We have oox = P[X), thus oo$ = and 3^ = X'^ (cf. 7.6.4). 

□ 

7.6.8. Proposition. The inclusion X^ C P{X^') = X'^ is exactly the nat¬ 
ural inclusion of X'^ into its (complex) projective completion. In particular, 
X '-□ 

7.6.9. An application of 5.2.2 shows that, for any / € GA.(X), there exists a 

unique f^ € GPjjc:(.X'‘') such that = /*■'• 

7.6.10. One can state similar results about subspaces. 

7.6.11. Proposition 7.6.8 justifies a posteriori the constructions and the 
diagram in 7.0.2. 

7.7. Exercises 

7.7.1. State and prove a converse for the formula o o f^‘ = f^ o a (7.2.1). 

7.7.2. Given / € L{E\E'), give conditions for the existence of an extension 
f ; E'^ —* E'' ol f such that / is semilinear with respect to conjugation in 
C (cf. 2.6.2). 

7.7.3. How can 7.5.3 be extended to arbitrary morphisms from P{E) into 

P(F)? 

7.7.4. Fill in the details of 7.6.4, 7.6.5 and 7.6.10. 

* 7.7.5. Show that the tensor product E®rC b a natural complexification 
of E, and so is HomR(C; E). 



Chapter 8 

Euclidean vector spaces 


Our universe is very well approximated by a Euclidean affine 
space, or, more precisely, by a Euclidean space up to a multi¬ 
plicative scalar, since there is no natural unit of length. Euclid¬ 
ean affine spaces are the main framework in this book, and we 
start studying them in chapter 9. Here we introduce this study 
by discussing Euclidean vector spaces, which underlie Euclidean 
affine spaces. 
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The treatment is largely algebraic, both in appearance and 
in content. Also, one must make a choice of what topics to cover, 
since the amount of material accumulated since the time of the 
Greeks is immense; in the interest of coherence, we have chosen 
a unifying thread, the orthogonal group O(-), and built around 
this (see a plan of study in 8.2.14). 

After introducing the necessary definitions and discussing 
some generalities, in sections 8.1 and 8.2, we study the structure 
of individual elements of 0{E) and their eigenspaces (section 8.4) 
and the structure of 0{E) as a whole (sections 8.5, 8.3, 8.9). A 
detailed analysis is made possible in dimension two by the use 
of complex numbers (section 8.3), and in dimensions three and 
four by the use of quaternions (section 8.9). Section 8.10 is in¬ 
formational only, and is meant to bridge the gap between the 
basic material and current research. Section 8.7 is quite tech¬ 
nical, dealing with oriented angles, a pretty fine invariant in 
the two-dimensional case which allows one to obtain nice results 
about circles (for instance, 10.9.7). The coarser notion of (non- 
oriented) angles between half-lines and between lines is more 
fundamental, and is developed in 8.6. Section 8.8, on similari¬ 
ties, is motivated, among other things, by the already mentioned 
fact that our physical universe is only Euclidean up to a scalar. 
Finally, section 8.11 introduces the classical cross product (use¬ 
ful in many areas) and the canonical volume form of a Euclidean 
vector space, which will be used to define the canonical measure 
of a Euclidean affine space. 

If you are familiar with the way mathematicians have of gen¬ 
eralizing everything, you will have suspected that one can gen¬ 
eralize the notion of Euclidean vector spaces, by leaving aside 
some of their properties. We only discuss such generalizations 
very superficially (chapter 13); the interested reader can consult 
[DEI], [PO], [S-T], [BI2|, [AN]. 
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Unless we mention otherwise, E is a. Euclidean vector space, whose dimen¬ 
sion we denote by n. 


8.1. Definition and basic properties 

8.1.1. Definition. A Euclidean vector space E is a finite-dimensional 
vector space of R, together with a positive definite symmetric bilinear form 
4> (i-e., 4> : E X E —* Ti is symmetric and bilinear, and <f>[x,x) > 0 for all 
X ^ 0). We write 4>{x,y) = (x | y), and call this num ber the scal ar prod uct 
of X and y. The norm of x is, by definition, ||x{| = \/<j>{x, x) = y^(x | x). If 

(x I y) = 0, we say that x and y are orthogonal. A set {ei},=i.fc C E is 

called orthogonal if (e,- | ey) = 0 for all pairs i ^ j, and orthonormal if it is 
orthogonal and ||ei|| = 1 for all i. 

8.1.2. Notes. 

8. 1.2.1. A vector subspace F of a Euclidean vector space E has a natural 
Euclidean structure given by the symmetric bilinear form <f>\F. 

8.1.2.2. The standard example of a Euclidean vector space is E = R", with 

n 

^((xi,...,x„),(y,,...,y„)) = X^Xjy<. 

i = l 

8.1.2.3. The map ^ is a quadratic form on E (cf. 3.3.2), and one has, for 
every x, y € E, 

[||x + yf = ||xf+ 2(x|y) + ||yf 
8.1.2.4 { 

i(x|y)=i(||x + yf-l|x|P-||y!P). 

8.1.2.5. An orthogonal set all of whose vectors are non-zero is linearly inde¬ 
pendent. 

8.1.2.6. If {e, } is an orthonormal basis for E, the coefficients of the decom¬ 
position I = XiCi are given by x, = (x | e^). Moreover, 

J * 

8.1.3. Proposition. For every x,y e E, we have |(x I y)| < ||x||||y||. 

Equality holds if and only if x and y are linearly dependent. For every x, y € 
E, we have ||x + y|| < ||x|| + ||y||; in particular, the map d : E x E "R. 
defined by d[x,y) = ||x —y|| is a metric, and generates the canonical topology 
on E (cf. 2.7.1). □ 
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Figure 8.1.3 


8.1.4. Proposition (Gram-Schmidt orthonormalization). Let 

be a linearly independent set in E. There exists an orthonor¬ 
mal set {ai}i=i....,fc with the following properties: .k is homotopic 

to . k (cf. 2 .7. 2 . 7) and, for every h = 1 , the span o/{a,},=i h 

is equal to the span of . h- Iv. particular, given a (possibly empty) or¬ 

thonormal set in E, one can always complete it into an orthonormal basis, 
and every basis is homotopic to an orthonormal one. 



Figure 8.1.4 


Proof. Use recurrence over k. For k = I, take Oi = fcf = || 6 i||~* 0 i; here we 
have introduced the notation x” = ||i||~^x, that is, x” is the vector obtained 
by normalizing x. It is easy to see that is homotopic to x: the homotopy 
can be taken as the segment from x to x", namely, x(t) = tx" -f (1 — t)x for 

t e [0, 1 ], 

Now assume we have found {oi},=i. h-i with the desired properties. 

Put 

h-l 

^ ^ (flt I bk^ai, Cf, bh d/,, ah c^. 

i=l 

One verifies immediately that {a,},=i. h satisfies our conditions, the hoino- 

topy being taken as a composition of which the two last steps lead from bh 
to Ch via bh(t) = tch + (1 — h)bh, and from ch to ah as above. □ 
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8.1.5. Proposition. Let E, Ef be Euclidean spaces of same dimension, 
and f : E —* E' a map. The following conditions are equivalent; 

i) / e L{E-,E') and |l/(x)|| = ||x|| for all x € E; 

“) (/(^) I /(y)) = i y) for x,yeE. 

Such a map is necessarily bijective and is called an isometry. The set of all 
such is denoted by 0(E]E'). 

Proof, (i) =>■ (ii) follows from linearity and 8.1.2.4. (ii) => (i); take am or¬ 
thonormal basis {ei} for E. By (ii) and the fact that E, E' have same dimen¬ 
sion, {/(ei)} is an orthonormal basis for E'; now 8.1.2.6 and (ii) show that 
/(E. 2 :.e 0 = Y^iXif{ei), so that / is linear. □ 

8.1.6. Corollary. Every n-dimensional Euclidean vector space is isomet¬ 
ric to R" (cf. 8.1.2.2). □ 

8.1.7. Some take advantage of 8.1.6 to talk about the Euclidean space of 
dimension n. This is not a crime, as long as one knows what one’s doing. 

8.1.8. Duality. 

8.1.8.1. Lemma. The map b : E B x y-* = {j/ 1 -* (x | y)} 6 E*, where E 

is a Euclidean vector space and E* its dual as a real vector space, is a vector 
space isomorphism. The inverse map is denoted by ^ : E* —* E. 

Proof. We know that dimE = dimE* (by considering dual bases, for exam¬ 
ple); since b is linear, we only have to check that its kernel is zero. But if 

(x I y) = 0 for all y, in particular (x (i) = )jx|p = 0, so x = 0. □ 

8.1.8.2. Lemma 8.1.8.1 gives a natural Euclidean structure to E*, and we 
will consider E* with this structure when necessary. 

The next result follows from 2.4.8.1 and 8.1.8.1: 

8.1.8.3. Definition and proposition. If A C E is a non-empty subset, the 
set 

A-^ = {xe£'|(x|y) = 0 for all y & A }. 

is a vector subspace of E, called the orthogonal subspace to A. If (A) is the 
span of A, we have i4-‘' = {A)-^. If A is a subspace, we have a direct sum 

A © = E, and dimi4 -f dimi4-‘- = dim.E, A-^-^ = A. If A, B are two 

subspaces, we have (A-i- B)'^ = A'^nB'^, (i4nR)'^ = A"^ Two subsets 

A,B of E are called orthogonal if A (Z B'^, or, equivalently, B C A"*"; in this 
case we write A J. B. 

8.1.8.4. Notation. ffB=K®Wisa direct sum such that W = K-*-, we 
talk about an orthogonal direct sum, and write E = V ®-‘- W. More generally, 
one can have an orthogonal direct sum of a family of subspaces: 
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8.1.8. 5. In the particular case dim£ = 3, 8.1.8.3 recovers results from 
Euclidean geometry which are otherwise taken as axioms, for instance; the 
set of lines perpendicular to a given line at a point forms a plane. As you may 
have noticed, this result is quite important in practice: it is thanks to it that 
doors can open and close without theoretical difficulty. Another corollary: 
if two lines D, D' are respectively perpendicular to planes P, P' at the same 
point, the plane spanned by D,D' is perpendicular to the line P D P'. 





Figure 8.1.8.5 


8.1.8. 6. Lemma 8.1.8.1 allows one to define the adjoint endomorphism to 
f : E E, denoted by ‘/. To do this, consider the transpose /* : E'’ —> E ', 
and set 

* / ='(i ° f* E —* E. 

The adjoint satisfies 

(‘/(s:) |y) = (2 1 /(«/)) 


for all x,y £ E. 
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8.2. The orthogonal group: first properties and plan 
of attack 

8.2.1. Proposition. The group 0(E) = 0[E;E) is called the orthogonal 

group of E; we write 0{n) = The condition f e 0(E] is equivalent to 

*AA = I, where A is the matrix of f in some (or any) orthonormal basis, *A is 
the transpose of A and I is the identity matrix. In particular, det/ = ±1; we 
set 0+[E) ^ [f e 0(E) I det / = 1} and 0-[E) = { / 6 0[E) | det / = 
— 1 }. The elements of O'^ {E) are called rotations. Finally, f 6 0(E) if and 
only if *// = We (cf. 8.1.8.6). 

This follows from the lemma below, which is a consequence of 8.1.2.6 and the 
definition of the product of two matrices; 

8.2.2. Lemma. Let{ei} be an orthonormal basis for E, andA,B two square 
matrices with rank equal to the dimension of E. Then, letting (resp. pk) 
be the k~th column vector of A (resp. B), we have 

^AB = ((xi I pj)). □ 

8.2.3. Remarks 

8.2.3.1. If n is the dimension of E, we have an isomorphism 0(E) S 0(n) 
by 8.1.6. Thus the study of orthogonal groups (at least as far as the linear 
group structure is concerned) can be restricted to the C>(n). 

8.2.3.2. The subgroup 0'^[E) is normal in 0(B). We have the obvious 
multiplication rules: 

0-(E)0-{E) C 0"^(E), 0-{E)0^{E) c 0-(E). 

8.2.3.3. In the topology induced from GL(£:) (cf. 2.7.1), 0[E) is compact. 
In fact, take an arbitrary basis for the finite-dimensional vector space E. If we 

denote by A(f) the matrix of / in this basis, the map / (Tr(M(/)j4(/))) " 

is the norm of a Euclidean structure over GL(£:). If, in addition, E is Euclid¬ 
ean and the basis taken is orthonormal, all elements of 0[E) have norm equal 
to n = dim E when expressed in this basis, by 8.2.1, and 0{E) is bounded in 
GL(£'). Finally 0(E) is closed since it is defined by the condition ^AA = I. 

Another way of seeing that 0(E) is bounded is to assign elements of 
GL(£:) their norm as linear maps from the Euclidean space E into itself (watch 
out, the norm thus obtained in not Euclidean!). We have, for / € 0(E), 

ll/ll = sup { ||/(x)|| I X e £■ and ||x|| = 1 } = 1, 

by the definition of 0[E), 

8.2.4. The fact that 0{E) C GL(E) is compact has the following very useful 
converse: 
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8.2.5. Theorem. Let G be a compact subgroup of GL(£), where E is a 
finite-dimensional real vector space. There exists on E at least one Euclidean 
structure 4> such that G C 0{E), where 0[E) is the orthogonal group for this 
Euclidean structure. 


8.2.5.1. First proof. The shortest proof uses the existence of the Haar 
measure on a compact group (see, for example, [DE14, volume II, p. 225 ff.]). 
Let dg be the Haar measure on G, and let 4> be an arbitrary Euclidean 
structure over E. For g €: G, define a new Euclidean structure by 
(g*^)(x, p) = ^(g(x),g(y)) for every x,y & E, and then take the average 
of the g" <f> for the measure dg: 


4 ,= 



g‘4>dg, 


that is, 

4>{9(x),g{y)) dg 

Ja€G 

for all X, y. It is obvious that 4> is still a Euclidean structure. Further, if is 
invariant under G because dg is invariant under translations of G. But saying 
that g* (j) — 4 for all g € (7 is the same as saying that G is contained in the 
orthogonal group of (E,<f>). □ 

8.2.5.2. Second proof. The second demonstration uses only Lebesgue 
measures on finite-dimensional real vector spaces, via 2.7.5.7. Consider the 
real vector space P^iEI) of quadratic forms over the n-dimensional real vector 
space E\ this has dimension n(n-l-l)/2. In P^iE), the subset Q[E) of positive 
definite quadratic forms is an open convex cone, invariant under the action 
of G defined in 8.2.5.1. Starting from an arbitrary 4> in Q(E), the orbit 
K — { g*4> I g € G} of ^ under G is compact in P^iE), and is contained in 
Q{E). In particular, its center cent(/f) is in Q{E) and is fixed under G by 
2.7.5.7. □ 



Figure 8.2.5.2 
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8.2.5.3. Third proof. We shall see in 11.8.10.8 a geometric proof for 8.2.5, 
based on convexity and not using measure theory at all. □ 

8.2.6. Remarks. If G is finite, G = { g, 1 1 6 7 }, just take 4> = Yli 9i4>- 

It is generally not true that there is only one Euclidean structure in¬ 
variant under G; suffice it to take G containing only the identity! But the 
following nice criterion holds: G has a unique invariant Euclidean structure 
(up to a scalar) if and only if G is irreducible in GL(E), that is, there is no 
subspace of E (other than {0} and E itself) left invariant by every g & G. 
See 8.12.1. 

For recent applications of 8.2.5, see, for example, jPA), [BOS, chapter 
VIII], and 8.12.1. 

8.2.7. Proposition. Assume dimE > 2. The group 0'^[E) acts transi¬ 

tively on the unit sphere S(E) of E, given by S{E) = {x 6 E | ||xj| = 1 }. 
It also acts transitively on the set of p-dimensional grassmannians G^.p of 
E (0 < p < dimE). The group 0(E) acts simply transitively on orthonor¬ 
mal bases of E. The group O'*" (E) acts simply transitively on homotopic 
orthonormal bases (cf. 2.7.2.7). □ 

8.2.8. Proposition 8.2.7 allows us to write the standard n-dimensional unit 
sphere 5” = 5(R”‘*'*) as the homogeneous space 

5" = 0(n-f-l)/0(n) 

(cf. 1.5.5), where 0(n) is naturally embedded in 0(n ■+ 1) as the subgroup of 
0(n + l) formed by maps / leaving the vector (0,... ,0,1) e invariant. 

Similarly, the grassmannians can be expressed as a homogeneous space 

G„,„ = 0(n)/(0(p)x0(n-p)), 

where 0(p) is the subgroup of 0{n) consisting of maps / leaving invariant 
the last n — p vectors .. ., e„ of the canonical basis of R", and 0(n — p) 
leaves invariant ei,...,e,,. 

8.2.9. Proposition. Let E = S ®T be a direct sum decomposition of 

E, and let a be the reflection through S and parallel to T (cf. 6.4.6). Then 
a G G(£') if and only if E = S T (or, equivalently, T = ); in this case 

a is denoted by cry, and is called an (orthogonal) reflection (or symmetry) 
of the Euclidean space E. Every reflection is an involution, and, conversely, 
every f G 0[E) such that f^ = Id^ is a reflection. We have as € O'^ (E) if 
dim S 'is even, and as 6 0~(E) if dim S-^ is odd. □ 


8.2.10. It is useful to write down the explicit formula for an in the case when 
77 is a hyperplane. Fix a non-zero vector x in (which has dimension l); 
then 


(^H{y) = y-2 


(y 1^:) 
l|x|P • 
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8.2.11. For x,y €. E with ||x|| = ||y||, there exists a hyperplane H such that 
<^h(x) = y, and H is unique if x ^ y, x / 0. If x = y, every hyperplane 
containing x works, and if not (x — y)-*- is the only hyperplane that works. 
Observe that H is just the hyperplane equidistant from x and y, defined 
(cf. 9.7.5) as /f = { z e £ I d(x, z) = d(y, z) }. 

8.2.12. Theorem. Every f e 0[E) is the product of at most n = dim.E 
hyperplane reflections. 

Proof. It is enough to use recurrence: by 8.2.11, one can reduce / to a map 
fixing X 0, and this map leaves invariant 5 = x"*", which has dimension 
strictly less than E. □ 

There is good reason to call 8.2.12 a theorem, in spite of the simplicity of its 
proof, because it is a fundamental result with many applications. If, instead 
of a Euclidean structure, we consider an arbitrary non-degenerate quadratic 
form on a finite-dimensional vector space over a commutative field, the result 
is still true, but its proof is much more difficult, and is, in fact, one of the 
central objectives of chapter 13. 

8.2.13. Corollary. If dim E = l, we have 0(E) = IdijU-Idij. If 
dim E = 2, every f € 0~(E) is a reflection through a line, and every f £ 

(E) is the product of two such reflections. □ 


D 



Figure 8.2.13 
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8.2.14. Plan of study for 0(E). 

Structure of an element of 0[E). In 8.4.5 and 8.4.6 we study the minimal 
number of reflections through hyperplanes (resp. codimension-two subspaces) 
which make up an arbitrary / € 0{E) (resp. 0'^[E)). In 8.2.15, we decom¬ 
pose / € 0{E) into a product of isometries of one- or two-dimensional sub¬ 
spaces, and thb leads to a study of 0(2) (since 0(1) is trivial, cf. 8.2.13). 
The essential fact here b that O'** (2) b abelian. 

Global structure of 0(E). Besides showing that 0+ (2) b abelian and studying 
the center of 0(E) and O'*'(E) (easy), we show in 8.5 that 0+(n) is simple 
for n = 3 and n > 5. The exceptional 0(4), which b indeed non-simple, 
b studied in 8.9.10 with the help of quaternions. Quaternions are a fine 
instrument in the study of 0(3) and 0(4). 

Topology and algebraic topology of 0{E). We shall see in 8.4.3 that 0'*'(E) 
is path-connected, and thus that O'*'(E) and 0~ (E) are the two connected 
components of 0[E). In 8.10.3 we find that the fundamental group ?ri(0(n)) 
is Z for n = 2 and Z 2 for n > 3. These are fundamental facts in mathematics; 
xi(0(2)) = Z is a cornerstone of complex analysis and geometry of plane 
curves, and 7ri(0(n)) = Z 2 has been the source of unexpected developments 
(cf. 8.10.3). 

8.2.15. Lemma. Let f € 0(E). There exists an orthogonal direct sum 
E = 0^ Pi such that f(Pi) = P< and dimPj = 1 or 2 for i^l i. 

Proof. By 7.4.3 there exists a one- or two-dimensional subspace Pi C E such 
that /(P,) = P,. □ 

8.2.16. The center of 0{E) and of 0-*[E). Assume that dimfT > 3 
(the two-dimensional case is studied in the next section) and that / 6 0[E) 
Commutes with every g € O'* [E). Taking g — ay to be a codimension-two 
reflection, we have f(V) = V because g fixed every point of K. Now /(P) = V 
for every codimension-two subspace V implies f[D) = D for every vector line 
D, so / is a homothety, / = Ald^ (cf. 4.5.3). Since / G 0{E), we have 
A = ±1 and / = db Id£;. Thus: 

8.2.17. Proposition. The center of 0[E) is {Id^i ~ Id^}- The center of 
0'*'{E) is {Ids} if dim E is even, and {Id^, - Id^} if dim £ is odd. □ 

8.3. The two-dimensional case 


In this section P is a Euclidean plane, non-oriented unless we say otherwise. 


8.3.1. Lemma. Fixing an orthonormal basis for E and letting M{f) be the 
matrix of f with respect to this basis, we have 
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i) / eO+(£)<=> M(/) = and = 1, 

n) / eO+(£:)<=> M(/) = ^“ aR(ia2 + 62 = i. 

Furthermore, for a fixed f € 0'*'{E), the real numbers a and |&{ do not depend 
on the basis. 

Proof. Let B = {«!, 62 } be the basis, and put M(f) = (JJ ^), that is, /(cj) = 
(o, fc) and /(ej) = (c, d). By assumption, /(e2) S t)ut we have 

dim(/(ei))'*' = 1 and (—6,a) € (/(ei))"*^. Thus /(e 2 ) = (—kb,ka) for some 
A; € R; but ||/(e 2 )|| = ||/(ei)|| = 1 implies = 1, so A; = ±1. Since 

det / = k(a^ + fc^), the case k = —1 corresponds to / € 0~(E), and the case 
A; = 1 corresponds to f £ 0'*'{E). Either way the scalar a is independent of 
the basis because Tr/ = 2a, and (6| because = 1 — a^. □ 


8.3.2. Remark. This proof shows that, given unit vectors ei and u, there 
exists a unique / S 0^(E) such that /(ei) = u. For then /(e 2 ) is determined 
up to sign, as it is contained in (/(ei))"*^; one choice gives / G O'^(E), and 
the other / G 0~(E). This remark about simple transitivity also follows 
from 1.4.4.1. 

8.3.3. Theorem. The set O' -{E) consists of reflections through lines of 
E. The group 0'*'(E) is abelian and acts simply transitively on the circle 
S(E) = {x€E\\\E\\ = 1}. 

Proof. The first assertion ensues from 8.2.13, simple transitivity in the second 
from 8.3.2. Commutativity follows from the calculation below: 


/a —b\/a' ~b' \ ( o,a' — ^l>' —[ab’ + a'b) N 
^6 ay y 6' a'/ \ab'+ a'b aa'— bb' J‘ 


□ 


8.3.5. Corollary. If E is oriented and we choose a positively oriented basis 
as in 8.S.1, b does not depend on the basis. For f G 0'^[E) and g G 0~ [E) 
we have fg = gf~^. Every f G 0'*'{E) is the product of two reflections 
through lines; one of the two lines can be chosen arbitrarily. 

Proof. Let B,B' be positive orthonormal bases and consider / G 0'^[E). The 
matrices Msif) and M^'if) are similar, since Mi'{f) — (Mg(t)) Mb(/) 

where t is defined by B' = t(8). For / G 0'^[E) and g G 0~(E), we 
have fg £ 0~'[E) by 8.2.3.2, hence fg is & reflection by 8.3.3. Reflections 
are involutive, so fgfg = Idg;; since g £ 0~(E) is also involutive, fg = gf~^ 
follows. Finally, if / G O'*'[E) and 9 is an arbitrary reflection in 0'~(E), one 
just writes / = (fg)g. D 


8.3.6. Definition and proposition. The cosine function cos : 0+(£) -» 
R is defined by f a, where a is as in 8.3.1. If E is oriented, the sine 
function sin : O'*'[E) —» R is defined by f >—*b, where b is as in 8.3.5. In this 
case the map 

0 : 0+(£7) 9 / i-> cos / + i sin / G C 
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is an isomorphism from O'*'(E) onto the group U of complex numbers with 
absolute value 1. The map 0 ts also a homeomorphism for the induced topolo¬ 
gies on O'^(E) c GL(.E) and U c C. 

Proof. 0 is a homomorphism by 8.3.4, and it is surjective since + 6^ = 1 
and clearly injective. It is obviously continuous and 0'^{E) b compact, so it 
is also a homeomorphism. □ 


8.3.7. Recap (see [FL, 181-183] or (CH2, 30], for example). The map 
A ; R 3 t I-+ e** € C, arising from the complex exponential map z e^ = 

Sn=o —T' ® homomorphism from the additive group R onto the multiplica- 

n! 


tive group U. The kernel A~^(l) is of the form ^Z, where ^ > 0; the numb^ 
^/2 is called tt, so A“*(1) = 2irZ. The circle U ts homeomorpkic to the quo¬ 
tient topological space R/27rZ. We also call cosine and sine the maps R —» R 
defined by cost = Re(e**) and sint = Im(e’*), respectively. 


0^[E) 



a 



Figure 8.3.7 


8.3.8. This implies, among other things, that U, and thus the circle S{E) 
for every Euclidean plane E, is path-connected, because R b. See also 8.12.4. 

8.3.9. Definition. Take f € 0'*^(E), where E is oriented. A measure of 
f is an arbitrary element 6 € A“ *(©(/)) (cf. 8.3.6). If 9 is a measure of f, 
every measure of f is of the form 6 -(- 2fc7r for A: 6 Z. 


8.3.10. K 0 is a measure of /, the matrix of / in any positively oriented 

orthonormal basis is f V in the notation of 8.3.7. 

\sm^ cos 6 } 


8.3.11. Notes. The restriction of the function cos to ]0, tt] b a bijection 
onto ( — 1,1]; its inverse b called Arccos. The restriction of sin to ]0, tt] is 
not injective, but the restriction to jO, 7r/2j b a bijection onto ]0,1], and its 
inverse b called Arcsin. 
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Figure 8.3.11 

8.3.12. Question and answer. One often identifies with C, but 
it would be nice, as suggested by 8.3.6, to identify an arbitrary oriented 
Euclidean plane E with C in an intrinsic way. This cannot be done, but it 
is possible to make E into a complex line using only the Euclidean structure 
of E and its orientation. In fact, let d € 0'^{E) be a rotation of E having 
measure x/2, and define multiplication by complex scalars on E by 

(A + i/j,)x = Ai -f ij,d[x). 

Since = — Idc and d € GL(£'), it follows that E is indeed a complex 
vector space with this multiplication and the preexisting vector addition. 

We will encounter d again in 8.7.3.5, this time without reference to mea¬ 
sures of rotations. Observe that the elements of [E) correspond to the 
linear maps z az for |ol = 1, and (real) homotheties correspond to maps 
z bz for h G R*. 

8.3.13. Notes. It may seem a waste to have to introduce complex exponen¬ 
tials in order to “measure” angles, but this measurement does in fact embed 
a fundamental difficulty, and it simply cannot be done without some cost. 
The reader can convince himself of this by consulting [FL, 178-186), [DE2, 
appendix I|, [BIl, V, §2 and VIII, §2). 

A much easier task is to show that every surjective continuous homo¬ 
morphism R —* U is of the form t >-♦ A(fcf), with A; G R* (see [FL, 184]). 
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8.4. Structure of elements of 0{E). Generators for 
0{E) and 0^{E) 

8.4.1. Proposition. Take f e 0(E). There exists an orthonormal basis of 
E in which the matrix of f has the form 

(h \ 

0 

0 

V Ar) 

where and Ig are the identity matrices of order p and q, and 

cos 6i — sinS.N 
sin 6i cos 0i ) ’ 

with fl,- € R \ ttZ for all i = 1,..., r. 

Proof. This result follows from 8.2.15 and 8.3.10. □ 

8.4.2. This decomposition is not unique, not only because the angles 6i are 
only determined mod 27rZ, but, more importantly, because if some of the 
angles of the Ai are equal, the basis is not unique (see example in 18.8). On 
the other hand, the structure of the pieces Ip and —Ig is weU determined, both 
the numbers p and q and the elements of the basis that correspond to them 
being determined by the proper subspaces Ker(Id£: —/) and Ker(Id£!+/) 
corresponding to the eigenvalues 1 and —1. In particular, p,q and r depend 
only on /. 

8.4.3. Corollary. The group 0{E} has two connected components, O'*'(E) 
and 0~{E), which are path-connected. If E is a finite-dimensional real vec¬ 
tor space, the group GL(£') has two connected components, GL'*'(£) and 
GL~(jE), which are path-connected. 

Proof. To show the latter assertion, endow E with an arbitrary Euclidean 
structure; by 8.1.4, it will be enough to show that 0'*'(E) is path-connected. 
Take / € 0'*'(E), and apply 8.4.1, using the same notations. The integer q 
is even if and only if det / = 1; a continuous path from / to Ids in O'*' (E) 
can be defined by the matrices 

B,{t) 0 ] 


Bg-[t) 

Ai(t) 

0 

V 


Mt)J 
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where t 6 [0,1] and 


cos vt 

— sin nt' 

sin rrt 

cos %t j 

cos $it 

— sin ffit 

smOit 

cos 6 it 


(» = 


□ 


One can prove that O'*' (f^) is path-connected using more elementary means, 
involving neither 8.4.1 or the measure of angles (whose definition, as we recall, 
resorted to the complex exponential, cf. 8.3.7). The problem is the same as 
finding a continuous path between two orthonormal bases 5, B' of E with 
same orientation. Take B = {ct}, B' = { <}; we will take into e\ by a 
continuous deformation, and this will prove the result by recurrence, if we 
consider the orthogonal complement e^. 

Thus, assume that ei ^ e\, and let P be the plane containing ei and e\. 
The circle S{P) is path-connected (cf. 8.3.8 and especially 8.12.4), and it is 
easy to extend a continuous path from ei to e\ into a continuous path from 
B and B", where the first vector of B" is e[. . 



8.4.4. We will now find exactly the smallest number of hyperplane reflections 
into which one can decompose an arbitrary / € 0(E) (cf. 8.2.12). We will 
also prove that codimension-two reflections generate 0'^[E) (just like hyper¬ 
plane reflections generate 0(E)), and will find the minimum number of such 
reflections in a decomposition of / € 0'^{E). 

8.4.5. Proposition. Let f e 0[E), and put 

s = dim E — dim(Ker(/ — Id£;)). 

Then f can be written as a product of s, but no less than s, hyperplane reflec¬ 
tions. If f is a product of s reflections through hyperplanes, the intersection 
of these hyperplanes is exactly Ker(/— Mb). 
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Proof. H / = aiii ■ ffi n ■ ■ ■ n C Ker(/ — Id^;), so k > s, 

and if A; = s this inclusion is actually an equality. Conversely, take / 6 0{E) 
and apply 8.4.1. The s hyperplane reflections that make up / are obtained as 
follows: for each basis vector ey 6 Ker(/ + Id^), take the reflection through 
the hyperplane e^; for each matrix A,- associated to a plane Pi, decompose 
the rotation of Pi into two reflections through lines of P,- (cf. 8.2.13), and 
extend these reflections into hyperplane reflections of E making them the 
identity on P^. □ 

8.4.6. Proposition. Let f e 0"^{E), with dimP > 3, and set 

s = dim E — dim(Ker(/ — Idc)). 

Then f is the product of s codimension-two reflections. 

Proof. This result is not used in this book, except in 8.5.3.1, and its proof 
is left as an exercise (cf. [FL, 193]), except in the case dim E = 3, when it 
follows immediately from 8.4.7.1 and 8.2.13. □ 

The proposition is false in dimension two, since then the only codimension-two 
reflection of P is — Id^, which is of course very far from generating 0'^[E). 


8.4.7. Examples. 

8. 4. 7.1. 0+(P) in dimension 3. By 8.4.1, every / e 0^(P) \ Id^ has a 

unique associated line D, called the axis of D; we say that / is a rotation 
with axis D. In fact, /Id € and is a plane. Such an / has ako 

an associated angle (cf. 8.6) € (0, jt]; if 0 = tt, / is the reflection through D. 

The practical calculation of $ is accomplished by remarking that, by 8.4.1, 
the trace of / is 1 -I 2cos^, and that the trace does not depend on the basis. 
Thus, in an arbitrary orthonormal basis, the matrix 

0 0 1 
10 0 
0 1 0 

is a rotation by an angle 2x/3 around the axis spanned by (1,1,1); observe 
that this rotation has order three. The uniqueness of the axis of / € O'^ [ E ) 
is used in an essential way in 1.8.3.1. 

8.4.7.2. 0 ~ ( E ) in dimension 3. By 8.4.1, we see that every map / G 
0 ~ ( E ) is the composition of a hyperplane reflection oh (here dim B = 2) 
and a rotation whose axis is the line D = 

8.4.7.3. The first example, other than 0"*'(2), where / G 0 [ E ) may not have 
any eigenvector occurs in dimension 4. In the notation of 8.4.1, 


M(/) = 


( cos Sx 
sin 6 i 
0 
0 


— sin 5 ] 0 ® A 

cos Si 0 ® 1 

0 cos 02 sin 02 I 

0 sin 02 cos 02 J 


and 01,02 ^ JrZ. 
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Two interesting cases, in some sense extreme ones, are the following: 
first, 81 = 62 , which is exactly the example already met in 1.2.9 and 4.3.6.2, 
and which will occupy a whole section (18.8). Second, Sj and 62 are “incom¬ 
mensurable”, in the classical Euclidean terminology; this means 81/82 ^ Q. 
Then the orbit of m € E \ 0 under the action of the group G = { /" ) 
n e Z } generated by / is an interesting object to study. Its closure is a 
differentiable submanifold of E homeomorphic to a torus (cf. 18.9). Figure 
8.4.7.3.2 represents something closely related to the orbit of /, the orbit of 


the group of rotations 

/costflj 

— sin t 8 i 

0 


M(/) = 

sint^i 

0 

costal 

0 

0 

cos 162 

0 

— sin 182 1 ’ 


^ 0 

0 

sin 1^2 

cosf02 / 


where t runs over R (and 81,82 are fixed, with 81/62 ^ Q)- Observe that the 
figure represents only the topological structure of the orbit and its closure, 
not the orbit itself in the four-dimensional space E (no wonder!). 



8.5. Simplicity of 0{E) 

This is a typical example of “geometric algebra”, the achievement of a 
purely algebraic result through geometric methods. We have met an example 
of this in 1.6.7.2 and the whole of chapter 13 is in this spirit; but these are 
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but momentary reversals of the general policy of this book, which is using 
algebra when necessary as a tool to solve geometric problems. 

We recall that a group G is called simple if it has no non-trivial normal 
subgroup (non-trivial means different from the identity and the whole of G). 

8.5.1. Theorem. The group O'*"(3) is simple. 

Proof. It suffices to show that if G C O'*'(3) is a normal subgroup different 
from the identity, G contains at least one codimension-two reflection. In fact, 
if g is the reflection through D and / 6 O'*'(3) is arbitrary, fgf~^ is the 
reflection through f(D) and still belongs to G. But O'*'(3) acts transitively 
on lines of R® (cf. 8.2.7), showing that G contains every codimension-two 
reflection and is thus equal to O'** (3) by 8.4.6. 



Now let / e G be arbitrary, / / Idg. By 8.4.7.1, / is a rotation with axis 
Ri, for X e R® \ 0, and angle d € jO, 7r[ (if 0 = tt, / is already a codimension- 
two reflection). Since /" € G, n € N, there exists n such that nd 6 l7r/2,7r[, 
so we may as well assume B € l’i'/2, xj. Now there exists a line D such that the 
two lines D, f{D) are orthogonal; to see this, take coordinate axes {x,y, z} 
in R®, where Rx is the axis of /, y is an arbitrary vector orthogonal to x 
and z = f(y). When D runs through all lines in the plane Rx + Ry between 
Ry and Rx, the angle between D and f{D) varies between 6 > 7 r /2 to 0 , 
and thus must be equal to x/2 somewhere. Choose g = cru as the reflection 
through D, and h = gfg~^ f~^ € G; the action of h on f{D) is given by 

f{D) 3 /~*(m) /“*(m) J-* m<^ —m € f{D), 

so that h\f^D) — — ldf^D), and by 8.4.7.1 h must be a reflection (through a 
line perpendicular to f(D)). □ 

8.5.2. Note. It may seem disappointing that we used properties of the real 
numbers, in particular the axiom for archimedean fields (to find n such that 
nB > ?r/2) to prove this result which is, after all, purely algebraic; but the 
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truth is that we could not have avoided doing so. There is an example of an or¬ 
thogonal group of a three-dimensional space with a positive definite quadratic 
form over a non-archimedean field which is not simple ([AN, 179 if.]). 

8.5.3. Theorem. The group 0'*'(n), modded out by its center (cf. 8.2.17), 
is simple for every n > 5. 

8.5.3.1. Proof. Let G be a simple subgroup of 0'^(n), and take / e 0'*‘(n)\ 

center. Our plan is to use / to construct g 6 G of the form g' Idv, with 
dimK = n —3 and g' € that is, g acts as the identity on V and as an 

isometry on K-*-. We have an orthogonal direct sum decomposition FF' = 
R”. By the proof of 8.5.1, we can find a codimension-two reflection of V-*-. 
This reflection can be extended by the identity on V into a codimension-two 
reflection of R"; since this extension lies in G, we have G = 0'*‘(n) by 8.4.6. 

8.5.3.2. There exists a plane P such that f[P) ^ P, otherwise every line 
of R" would be invariant under /, and / would be in the center of O"*' 

(cf. 8.2.16). Let S = P + f(P) be the subspace spanned by P and f{P)', since 
n > 5, we have S-^ ^ {0}. Consider the reflection h = Opi through P-*-, and 
write k = hfh~'-f~^. Then k = (Xp±aJ^p^^, and in particular fc| 5 J. = Id.iji, 
so fc 6 G \ center. Fix x € 5-'- \ 0 and y such that z = fc(y) / y. For every 
u e R" \ 0, denote by = cr„ j. the hyperplane reflection through . Then 
we have / = tyt^ € 0'*'(n), 

g = klk~^l~^ € ^ \ center. 


g — ktyt^k tj.ty — ktyk txtxly ~~ “ I'zl'y 

(because ktxk~^ = <^( 1 ) = tx, so ktx = txk). This g — t^ty is of the desired 
form, g = g' ® Idv', with dim = 3 (in fact here V-^ C S). □ 



8.5.3.3. What happens for G(4)? The answer will be given in 8.9.10. The 
basic reference on the simplicity cf the so-called classical groups is |DEl]; see 
also 13.6.8 and 13.7.14. 
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8.6. Angles between lines and half-lines 


In this section dim E >2. 

This section discusses non-oriented angles, as opposed as oriented angles be¬ 
tween lines or half-lines of a Euclidean plane, which are the topic of the next 
section. Non-oriented angles are dehned in a Euclidean space of arbitrary 
dimension > 2, possibly a plane. Lines here are vector lines; the affine case 
will be treated in the next chapter (cf. 9.2.1). 

8.6.1. Half-lines and oriented lines. Let E be an arbitrary real 
vector space. A ha.lf-line of E is a subset of E of the form x for x & E\0. 
The set of half-lines is denoted by D(E). The set of vector lines of E, alias the 
grassmannian Ge.i, will be denoted here by V[E). There is an obvious map 
p ; D[E) —* D{E), under which every point has two inverse images. Given 
A 6 P(E), its opposite half-line —A is the other inverse image of p(A), that 
is p(—A) = p(A) = Ri for any x 6 A. 




Recall that P{E) is simply the projective space P{E) (cf. 4.1.3.4). If E 
is a Euclidean space, P{E) also has a nice interpretation as the unit sphere 
S(E) of E, given by the bijection P{E) 9 x 6 Finally, P(E) is 


equivalent to the space of oriented vector lines of E (cf. 2.7.2). 


8.6.2. The definition below arises partly from the measurement of angles 
(cf. 8.3.7 and 8.3.11) and partly from 8.6.6: 


8.6.3. Definition. Let E be a Euclidean vector space and 
A,A'eP[E], D,D’eP{E). 


i) The scalar 




depends only on D and D', and not on the points 


X 6 D \ 0 a nd x' € D' \ 0. The (non-oriented) angle between D and D', 
denoted by DD' or D, D', is the real number £ [0,7r/2] defined as 



DD' = Arccos 
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for X e Z) \ 0 and x' 6 D' \ 0 (cf. S.3.11). 


ii) The scalar 


(x I x') 


depends only on A, A', and not on the points x € 


A \0, x' e A' \0. The (non-oriented) angle between A and A', denoted 
by A A' or A, A', is the real number £ [0, tt) defined by 


AA' = Arccos 


/ (x I x') \ 

Vl|x||||x'||/ 


8.6.4. In other terms, 


cos(DD') = 


|(j: I J^*)| 


cos(AA') = 


I ^') 


|fx|x')| 

By 8.3.11, we just have to verify that . 6 |—1, l), and this follows 

from 8.1.3. 


8.6.5. Notes. From 8.1.3 we see that DD' = 0 if and only d D = D', and 
DD' = 7r/2 if and only if D,D' are orthogonal. Analogously, AA' = 0 if and 
only if A = A', AA' = jr if and only if A = —A, and AA' = ir/2 if and only 
if A and A' are orthogonal. 

The value of angles is a hereditary property: U F C E is a subspace 
containing A, A', the angle A A' is the same, whether measured in F or in 
F, and ditto for P(E). 

Angles in P(E) or P{E) obviously satisfy two of the axioms for a metric; 
as we shall see in 9.9.8, chapter 18 and section 19.1, they satisfy the third 
as well. The metrics they define are, respectively, the intrinsic metric of the 
sphere S{E) and the elliptic metric on the projective space P[E). 

8.6.6. Proposition. Consider the obvious action of 0(E) on P~{E) and 
P^(E) given by g( , ) = (g( ), g( )). Then 

i) under the action of 0(E), two pair s (D,D ') and {Di,D[) belong to 
the same orbit if and only if DD' = DiD'^, and two p airs (A , A') and 
(Ai, A'l) belong to the same orbit if and only if A A' = Aj A'j/ 

ii) under the action of 0'''[E), both statements in (i) hold if dim E > 3, but 
for dim E = 2 they are both false. 

In other words, the orbits of 0(£^) in P^{E) (resp. P^[E)) are parametrized 
by a scalar € [0, tt] (resp. 6 [0,7r/2]), and for dim E > 3 the orbits of O"*" (£) 
are the same as those of 0(E). 

Proof. We choose to demonstrate the proposition for P(E)] the case P{E) is 
no more difficult. By 8.2.7, we can assume that A, A', Ai and Aj are all in 
the same plane of E, and also that A = A|. Pick an orthonormal basis {x, y} 
of this plane such that x C A — Ai, and vectors x' G A' and x'j G A' such 
that Ijx'll = ||x'i|| = 1. By definition and the assumption that AA' = AiA'^, 
the two vectors x', x'j have same x-coordinate; since their norm is 1, their 
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y-coordinates have same absolute value. If they are equal we have A' = Aj 
and we are done; if they are symmetric, the reflection (Ta satisfies 

<ta(A') = A',. 

This reflection of the plane can be extended into a codimension-two reflection 
of E ii E > 3, proving part (i). As for part (ii), figure 8.6.6 gives the desired 
counterexample. □ 

8.6.7. Question. FVom the above, a real number in [0, jr/2] classifies equiv¬ 
alence classes of pairs of lines under isometries; a natural question is to classify 
pairs of subspaces (F, W) of E, where V and W have fixed dimensions. 

When dim.E = 3, the classification by one parameter in [0,7r/2] works 
nicely. For dimK = dimW = 2, the parameter is the angle between the or¬ 
thogonal complements V-^, W-*-; for dimK = 1 and dimW = 2, one can take 
the infimum of VD for D C W, which is achieved when D is the orthogonal 
projection of V onto W. 

In higher dimensions, starting with dim E = i and dim K = dim W = 2, a 
single real parameter is no longer enough to classify the orbits of pairs (V^, W) 
under the action of 0[E). For a complete discussion, see |FL, 310-316]. 



Figure 8.6.7 
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8.7. Oriented angles in the plane 


In this section dim E = 2. 


8.7.1. It follows from 8.6.6 (ii) that angles (in [O, w\ or [0, Tr/2]) do not suffice 
to classify the orbits of D^(E) or D^{E) under O'^(E) in dimension 2; a finer 
invariant is required. This finer invariant, the oriented angle between lines, 
is well worth the trouble of defining it, since it will allow us to derive quick 
and elegant results in later chapters (see for example 10.9). 

It may appear that the machinery used to introduce something as simple as 
angles between lines is unnecessarily heavy, but there is simply no lighter or 
less cumbersome method, as the reader can verify by consulting [FL, 160-186) 
and comparing it with older texts like [I-R) and |D-Cl], where all relations 
between oriented angles are formulated “mod fc?r” or “mod 2fc?r”. The gain 
in clarity in our treatment is obvious. 

8.7.2. Oriented angles between oriented lines 

8.7.2.1. Since 0'*'(E) acts simply transitively on i^^{E) (cf. 8.3.3), we can 
consider the map $ : D^[E) —* 0+(£). This map gives an equivalence rela¬ 
tion on P^{E) (equivalent points have the same image). Another equivalence 
relation is obtained by considering the orbits (cf. 1.6) of P^{E) under O'^(E). 
We make the following agreeable remark: 

8.7.2.2. Lemma. The two equivalence relations above coincide. 

Proof. The first identifies two pairs (A, A') and (Ai, A]) if there exists g £ 
0+(£7) such that A' = g(A) and Aj = 9 (A]); the second, if there exists 
/ £ [E) such that A) = /(^) and Aj = The lemma foUows from 

the commutativity of 0'*'[E). □ 

Thus, calling ~ either of these relations, we can consider the quotient A(E) = 
D^{E)/ the canonical projection : P^(E) —* A{E) and the map $ : 
A(E) —» 0'*^(E) obtained from $ by passing to the quotient. Since $ is 
surjective, $ is by construction a bijection: 

P^(E) A{E) 

0^{E) 

8.7.2.3. Definition. We call A[E) the set of oriented angles between 
oriented lines of E. This set has a group structure deriving from that of 
0'*'(E); we denote the group operation additively. The oriented angle between 
A, A' £ P{E) is defined as ${(A, A')), and denoted by AA' or A, A'. 

We list a number of mostly trivial but useful properties of oriented angles: 
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8.7.2.4. Proposition. //A and A' are arbitrary oriented lines, 

i) A' = /(A) with f 6 O'^(E) if and only »/$(AA') = /; in particular, 

AA' = 0 <=>■ A = A'; 

ii) AA' = AjA'j^ if and only if there exists f € 0'*'{E) such that /(A) = Ai 
and f(A') = A',; 

iii) A A' = AiAj if and only if AAi = A'A\; 

iv) AA' + A'A" = A A" (Chasles’s relation); in particular, AA' = — A'A; 

v) /(A)/(A') = AA' for any f 6 0'*^{E), and f(A)f(A') = -AA' for any 
feO-{E). 

Proof. The only property that does not follow from the definitions and 8.7.2.2 
is the second statement in (v), and thb follows from 8.3.5. □ 

8.7.2.5. Remarks. We see that it is not necessary to orient E in order to 
define oriented angles; but in order to measure them, one does need to fix an 
orientation. 

Observe the relation between some of the properties above and those 
for affine spaces, for example, Chasles’ relation and the parallelogram rule 

(ab = cd <=> ac = bd). These relations hold for every simply transitive 
abelian group. 

8.7.3. Bisectors. We look for solutions of the equation 2x = a in the 
group A[E). We first solve the equation 2x = 0, or, equivalently, p — Id^ 
in 0'^[E). If M(f) = we know from 8.3.4 that = 1, so o = ±1 and 

6 = 0. This implies f = ±Id£;. (This can also be seen using our knowledge 
about involutions, cf. 8.2.9.) 

8.7.3.1. Proposition. The equation 2i = 0 has exactly two solutions, 0 

and Idc). We denote this solution by co, and call it the Bat angle. 

The condition AA' = ra is equivalent to A' — —A, where —A is the half-line 
opposite to A (cf. 8.6.1). □ 

This result 8.7.3.1 is also a consequence of what follows now. 

8.7.3.2. Definition. An oriented line S bisects the two oriented lines A 
and A' if AE = EA'. We say that E is a bisector of A and A'. 

By 8.7.2.4 (iv) and (v), an oriented line E bisects A and A' if the reflection 
CTi; through E (cf. 8.2.9) satisfies <ts(^) = But we know from 8.2.11 that 
there is a unique line supporting such aE, because an oriented line determines 
a unique positive unit vector. Thus A and A' have exactly two bisectors ±E, 
opposite to each other. Further, AE = EA' implies AA' = 2AE, whence the 
following result; 

8.7.3.3. Proposition. Two oriented lines A, A' have exactly two bisectors, 
supported by the same line and opposite to each other. For every a 6 A(E), 
the equation 2x = a has exactly two solutions, of the form {6,6+ w}. If 
A A' = a, the solutions x of 2x = a are related to the bisectors E of A, A' by 
X = AE. 
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A’ 



8.7.3.4. The equation 2x = ta is particularly interesting, because if AA' = x, 
where x satisfies 2x = a, the lines supporting A and A' are orthogonal, and 
conversely. One can see this algebraically or geometrically: the condition on 
the matrix (“ is that 0 = 0, whence b = ±1; and, geometrically, the 
bisectors of two lines A, A' are orthogonal to A because the line D which 
supports A must be invariant under cx; and distinct from D. Moreover, the 
solutions of 2x = C7 are connected with orientations as follows; 



Figure 8.7.3.4 



Figure 8.7.3.5 


8.7.3.5. Proposition. The solutions o/2x = g 7 are called the right angles of 
A(E). Choosing one of these angles is equivalent to orienting E, as follows: 
if d is the chosen angle, the orthonormal base {i, y} is positively oriented if 
and only if y = $(3)(x). □ 

Observe that d here is indeed the same as in 8.3.12. 

8.7.4. Measuring oriented angles. We assume in this paragraph that 
the Euclidean plane E is oriented, and apply 8.3.6 and 8.3.9; 
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8.7.4.1. Definition. The measure (or a measure) of an oriented angle 
a 6 A(E) is a measure of ^(a), i.e., an element of A~*(0($(a))); 



8.7.4.2. Examples. The real number x is a measure of rs, and ?r/2 is a 
measure of 9. If t is a measure of a, every measure of a is of the form t + 2A;Tr, 
where A; € Z. 

Using measures, one can show that the equation nx = a in A{E), where 
n is an integer > 1 and a £ A{E), has exactly n distinct solutions (cf. 8.12.7). 

8.7.5. Relationship between AA' and We now assume that E 

is oriented. Let A, A' be two oriented lines, seen here as half-lines (cf. 8.6.1). 
The angle AA' always has a measure in the interval 10,2x1. On the other 
hand, we can consider the basis {x, x'}, where x £ A and x' £ A' (excluding 
the case A' = ±A); the orientation of this basis does not depend on the 
choice of x and x'. 

8.7.5.1, Proposition. Let t be the measure of AA' that lies in |0,2x[. 
Bases defined by (A, A') are positively (resp. negativel y) ori ented if and only 
if t € [0,x( (resp. t £ [x, 2x[j, and then AA' = t (resp. A A' = 2‘K — t), except 
for the trivial cases A = A', t = 0 and A = —A', f = 2x — t = x. 



Proof. Leaving aside the trivial cases, we can assume A' ^ ±A. Changing the 
orientation switches the sign of A', so t becomes 2x —f; this reduces the proof 
to the case of positively oriented bases. More precisely, let x £ A, x' £ A' be 
unit vectors, and {ej = x, * 2 } be the positively oriented orthonormal basis 
with first vector x. The key point is that the coordinate of x' along the *2 
direction is positive; but that coordinate is sint, so t £ (0, x[, and 8.3.11 
completes the proof. □ 

8.7.5.2. Definition. Let A, A', A" be three half-lines of a (not necessarily 
oriented) Euclidean plane. We say that A' lies between A and A" if one 
of the following conditions obtains: either A = A' = A", or A = —A", or 
A' ^ ±A and A' belongs to the intersection of the half-plane defined by A 
and containing A" with the half-plane defined by A" and containing A. 
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Figure 8.7.5.3 

8.7.5.3. Corollary. If A' is between A and A", we have AA' + A'A" = 
A^. 

Proof. We only consider the cases when the three half-lines are distinct, the 
other being trivial. Choose the orientation of E in such a way that bases 
defined by (A, A') are positively oriented; the hypotheses show that the same 
is true of bases defined by (A', A") or (A, A") (excluding the case A" = A). 
Let s,t be the measures of AA' and A'A" that lie in |0, 27r[; by 8.7.5.1, 
s, t € jo, 7r[ and s = AA', t = A'A". But s + t is a measure of A A" and 
s -I- t e |0,27r[; if A" = —A we have 5 + 1 = ^, and if A" ^ —A bases 
defined by (A, A") are positively oriented, as remarked above, which proves 
the corollary by 8.7.5.1, since s + t = AA". □ 

8.7.5.4. Corollary 8.7.5.3 is very important. It k a very intuitive result when 
interpreted in the following physical way: define the sector determined by A 
and A' as the set of half-lines lying between A and A' if A / A', or either 
of the half-planes determined by the line supporting A, if A = A'. Then the 
non-oriented angle A A' is the measure of the sector determined by A and 
A', and if A' lies between A and A" the sector determined by A and A" is 
the union of the sectors determined by A, A' and A', A". 

Another approach starts from the arc of circle determined by two ori¬ 
ented lines, that is, the part of the unit circle S{E) of E that intersects the 



8.7. Oriented angles in the plane 


179 


corresponding sector. Then 8.7.5.3 says that the length of an arc of circle is 
the sum of the lengths of the two arcs into which it is divided. We shall come 
back (9.9.8) to this viewpoint, which originates in 8.6.5. More about sectors 
in [DE2, 65-67] and [FL, 183]. 

8.7.6. Notes. The preceding discussion illustrates the difficulty in measur¬ 
ing angles, stemming from the fact that the union of two sectors is not always 
a sector (figure 8.7.5.3), and, consequently, one cannot add sectors arbitrarily 
(see also [FL, 185-186]). 

There is an interesting construction for an order relation on A{E) \ 
w. look at A(E) as a circle (not intrinsically, cf. 8.3.6), and consider the 
stereographic projection of this circle onto R with pole w (cf. 18.1.4). This 
gives a bijection between A(E) \ to and R, and we can transfer to A(E) \ to 
the order relation on R. For more details, see ]FL, 176] or [DE2, 97-98]. One 
could also use the restriction of A to ] — ff, tt]. 



8.7.7. Oriented angles between lines 

8.7.7.1. Our object here is to extend the preceding discussion to lines and 
their set P^(E). This presents no difficulties, so we restrict ourselves to a 
quick presentation of the results, without any demonstrations. 

The starting point is either of the following two equivalent remarks: a 
line D determines exactly two opposite oriented lines (neither of them having 
precedence over the other, of course); and the group O'^ (E) does not act 
simply transitively on P^(E), but the quotient group 

PO+(E) = 0+(£')/{Id£,-Id£} 

does. The name PO'*'(£') stands for projective orthogonal group, because 
P(E) = P(E). We will discuss projective orthogonal groups in more general¬ 
ity in 14.7.2 and 19.1. 

8.7.7.2. Following 8.7.2.1, we define : P'^(E) —* PO'''(E) as the map 
canonically associated to the simply transitive action of the group PO’^(E) = 
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O"^ (£7)/{± Hb} on P^(E). The equivalence relation ~ is the same as the 
one given by the action of PO'*'(£). The quotient P^[E)/ ~ is denoted by 
A(E), the canonical projection by p : P^(E) —» A{E), and the induced map 
A{E) —» PO‘''(£') by 3!^. Thb map is a bijection: 

P^E) A(E) 

* \ y ^ 

O-^(E) 

Here q : P^(E) —* P^(E) comes from the homonymous map q : P(E) — * P(E) 
which takes an oriented line into the line supporting it. 

We call A[E) the group of oriented angles between lines of E, its abelian 
group structure being lifted from PO'*’(£r) via 2- The oriented angle between 
the lines D,D' 6 P{E) is the element p((£>, D')) £ A{E), and is denoted by 
DD' or D^'. 

8.7.7.3. We have the following relations; 

= d7d\ <=> = lfD\i = 0 <=►£) = P'; 

f(D)T(D') = ±DD' for / 6 O* (P). 

8.7.7.4. The equation 2a: = 0 in A(E) has two solutions, one being 0. The 
second, denoted by S and called the right angle, is the image under q of the 
two right angles in A{E). The equation 2x = a in A(E) has always two 
solutions, of the form {x,x + a}; if o = DD', the two lines S such that 
X = DS, for 2x = DD', are^orthogonal, and called the bisectors of D, D'. 
They are characterized by DS = SD', or (rs(D) = D'. 


S 




8.7.7.5. Let D, D', S, S' be lines such that the cross ratio [P, P', 5, 5'] in 
the projective line P(E*) is equal to —i. Saying that P J. P' is equivalent 
to saying that P and P' are the bisectors of S and S'. 

8.7.7.6. In order to measure oriented angles between lines, we have to give 
an orientation to E. Define maps © and A so the diagram below (cf. 8.3.6, 
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8.3.7, 8.7.4) commutes. By definition, a measure of a e A{E) is an arbitrary 
element of A~*(0(3!.(a))). All measures of an element of A(E) are of the 
form t + itk, for € Z; 0 is a measure of 0, and ?r/2 is a measure of 6 . 

A(E)^0+(E)-^ U 

|p |ldR 

A(£:)-^PO+ (£;)-^U/{±l}<^R 

If £ is oriented and D, D' € D{E) are distinct lines, DD' has a unique measure 
t in the interval )0,7r[, and we either have DD' = t or DD' = ?r — f (cf. 8.6.6). 
If D and D' are not orthogonal, we can choose a unique orientation for E so 
that t e ]0, Tr/2|. Observe that there is no analogue for corollary 8.7.5.3 here; 
for example, if D, D', D" are as in figure 8.7.7.6, we have DD' = D'D" = 
DD" = 7r/3, and no matter what in order we consider the lines, we can never 
have ^W' + D'D" = DD"\ 




Figure 8.7.7.6 


Figure 8.7.7.7 


8.7.7.7. The proof of 4.3.6 hints that there is a natural bijection between 

A{E) and A(E). In fact, consider the map A(E) 3 i i—> 2r 6 A{E), with 
kernel { 0 , 117 }. By 8.7.3.3, the map A{E) —► A(E) induced on the quotient is a 
bijection which, by abuse of notation, is still written A[E) 3 x 2x € A[E). 
Its inverse is written A(E) 3 x >—• € A(E). 

Figure 8.7.7.7 gives a geometric interpretation for these maps, explaining 
why in trigonometry cosines, sines and tangents can be expressed as functions 
of the tangent of the half-arc. 

8.7.7.8. Proposition. Let D, D' be lines in E. Then 

CTD' °<7d = ${'2DD'), 

that is, ao' °crD ** o rotation by the angle 2DD'. 

Proof. One knows (8.2. 3. 2) that / = ao- o (Tu 6 0'*(E)-, we have f{D) = 
od'(D) = D" and DD" = 2DD' by 8.7.7.4. Since 0'^[E) is simply transitive, 
the proposition is proved. □ 


182 


Chapter 8. Euclidean vector spaces 



8.7.7.9. The maps between O'^ [E) and PO'*'(^) corresponding to the dou- 
bling and halving of angles will be denoted by and respectively. 

8.7.8. Trigonometry. The relation = e*e* implies the following 
formulas; 

cos(a + fc) = cosacosfc — sin a sin b, 

8.7.8.1 . ! . 

sin(a + fc) = sin a cos b -I- sin bcos a, 

which hold for all a, 6 € R. From 8.3.6 it follows that for E oriented the same 
formulas hold in O'^(E), with the product fg of maps f,g& 0'^[E) instead 
of the sum a + b. 

8.7.8.2. FVom the definitions of the group structures of A(E) and A{E), it 
follows that, always assuming E to be oriented, formulas 8.7.8.1 hold without 
change in A(E) and ^(£^), where the cosine and sine of an oriented angle are 
defined as the cosine and sine of any of its measures. 

8.7.8.3. One deduces from 8.7.8.1, through various algebraic concoctions 
and decoctions, a number of other formulas, some of which are very useful 
and nice; for a sizable list, see 8.12.8. For applications of such formulzis, see 
10.3.10 and 18.6.13. 

8.7.8.4. Conversely, oriented angles can be used in the geometric study of 
complex numbers. Consider C as an oriented Euclidean plane, that is, 
with its canonical Euclidean structure and orientation. For a £ C \ {0}, 
we can write z — |a|(;^| 2 |), where zl\z\ € U. The oriented angle (0 o 
S)~^(a/|a|) £ .^(C) = .^(R^) is caUed the argument of the complex number 
z, and is denoted by arg(a). If a = 0 we put arg(a) = any element of A{C). 
The number a £ C is determined by |a| and arg(a), and the pair (|a|, arg(a)) 
is called the polar form of a. One has 

8.7.8.5 |aa'| = |a|| 2 '|, arg(aa') = arg( 2 ) + arg(a'). 

In calculations it is ofter easier to use measures, that is, to write a = {a{e'^, 
where t is a measure of arg(a). This expression is also called a polar form of 
a, and is sometimes written in the form a = | 2 |(cost + isint). In 9.6.5.1 we 
will find some classical applications of this formula; see also 12.4.2. 
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8.8.1. We return to arbitrary-dimensional Euclidean spaces E. A similarity 
of £ is a map that preserves not lengths, but ratios between lengths. The 
interest of this notion lies not only in the nice applications it gives rise to 
(9.6.7, 9.6.8, 9.6.9), but also in its connection with our physical universe 
which, not having a privileged unit of length, is Euclidean only up to an 
arbitrary scalar constant. 

8.8.2. Definition. A similarity of E, of ratio n, is a map f € GL(.E) 
satisfying ||/{x)|| = /^HxH for all x ^ E. The set of similarities is denoted by 
GO(£'); orientation-preserving fresp. reversingJ similarities are the elements 
o/GO+(£') = GO(£') nGL+(£) {resp. GO-{£') = GO(i:) n GL~(f^)). 

8.8.3. A homothety Ald^ is a similarity, of ratio p = |A|. If A < 0, AId£; 
preserves orientation if and only if dim£ is even. If / € GO(E) has ratio 
p, We have (/(x) 1 /(v)) = p^(x \ y) for all x, y 6 E-, to see this, write 
/ as / = (p~^f) o (/ilde), and apply 8.1.5 to p~^f 6 GL(.E). This also 
shows that GO(£) is isomorphic to the direct product of groups GO(.E) = 
0{E) y. (R; Ids); similarly, GO+(£:) “ 0+(£) x (R^ Id£;). For n = 1, we 
evidently have GO(£) = GL(£); this trivial case will often be excluded in 
what follows. 

8.8.4. Plane similarities and complex numbers. Let E be an 

oriented Euclidean plane, which we identify with a complex vector line as 
in 8.3.12. The elements of the complex linear group GLc(.E) of E sat¬ 
isfy definition 8.8.2, so GLc(£^) C GO(.E). In fact, one has the equality 
GO'*'(£) = GLc(£^)i shown by 8.8.3 and 8.3.12. To recover GO~[E), fix 
an arbitrary s € 0~(E) and write GO~(£^) = s GO'^(E). Since s is reflection 
through some line, we can take a basb of E lying on this line (a complex basis 
of E has a single element!) and deduce the following 

8.8.4.1. Proposition. In an arbitrary complex basis of E, an orientation- 
preserving (resp. reversing) similarity of E is a map z >—* az (resp. z i—. azj, 
where a is any element of C*. 

8.8.5. Similarities and angles 

8.8.5.1. Proposition. Similarities preserve angles. More precisely, for 
A, A' € V{E) and D, D' 6 P(E), we have: 

i) if f e GO(£;), then f(A)f(A') = AA' and f(D)f(D') = 

ii) if dim E = 2 and f & GO^^ [E) {resp. GO~(.E)), then /(A^/IA') = 

(resp. -AA'; and f(D)f[D') = (resp. -DD'); 

iii) if f ^ GO(£), the conditions D ± D' and f{D) -L f(D') are equivalent. 

Conversely, if dim E > 2 and f 6 GL(.E) is such that f(D) and f{D') are 
orthogonal whenever D,D' £ P(E) are, f is a similarity. 
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Proof. The only non-trivial part is the converse, since parts (i), (ii) and (iii) 
follow from 8 . 6 . 6 , 8 .7.2.4 and 8 .7.7.3. ff / preserves orthogonal lines, we have 
(/(®) I — 0 for all X, x' such that (x | x') = 0. Start by fixing x S P\0 

and consider the linear form 4> : y (/(x) | /(y)); this form vanishes on the 
hyperplane x"*- by assumption, so there exists by 2.4. 8.6 a scalar k(x) € R* 
such that 

(/(a:) I /(y)) = Hx){x I y) 

for all y € .E. To show that fc(x) does not depend on x, consider k{x + x'), 
where x, x' are linearly independent; we have 

fc(x + x')(x + x' I y) = (/(x + x') I /(y)) = (/(x) | /(y)) + (/(x') | /(y)) 

= fc(x)(xly) + A:(x')(x'|y) 

for every y & E, whence (A:(x + x') — fc(x))x + (fc(x + x') - k(x'))x' = 0. By 
linear independence, this implies k[x) = k[x') = fc(x + x'), and it follows that 
A; is a constant and / is a similarity. □ 

8.8.6. Similarities and complexifications. Let E'' be the com- 
plexification of E, € Gh^E^) the complexification of f € GL(£), and 
N'' , a quadratic form on E '', the complexification of the quadratic form 
N :x-* ||x||2 on E (cf. 7.1.1, 7.2.1, 7.3.4). 

8 . 8 .6.1. Definition. The subset (N^)~^(0) of E'' is called the isotropic 
cone of E if dimE > 3. If dim E = 2, this set consists of two distinct lines 
of E'-^, called the isotropic lines of E; this pair of lines is denoted by {/, J}. 
The conjugation a on E'^ (cf. 7.1.1) satisfies a((7V‘^)~*( 0 )) = (.Ar''’)~*( 0 ). 
For dim£ = 2, conjugation permutes I and J. 

The word cone is justified because, from the homogeneity of TV, we have 
N'-'[Xx) = A^TV''(X). The assertion about (TV'^)"*( 0 ) consisting of two 
distinct lines follows from the non-degeneracy of TV, because then N'' is also 
non-degenerate (cf. 7.3.5, 13.2.1, 13.2.3.1). We give a direct proof, that does 
not depend on 13.2: let { 61 , 62 } be an orthonormal basis for E] then 

TV'^'(zi,Z 2 ) = 2 ? + «! 

(cf. 7.3.5), and -f ^| = 0 is equivalent to zi = ±iz 2 - The other assertions 
are immediate: 

o{(TV'^)-M 0 )) = (TV‘-)-‘(<^( 0 )) = (TV'")-‘( 0 ) 

by 7.2.1, and (t{I) — J, ct(J) = / by taking, 1 = {(z, -tz)} and J — {(z,iz)}, 
for example. We say “for example” because there is no distinction between 
the two isotropic lines of a Euclidean plane; more precisely: 

8 . 8 . 6 .2. Proposition. Picking out one of the isotropic lines of the Euclidean 
plane E is equivalent to orienting E. The correspondence is given by choosing 
the orientation of E with respect to which the chosen line 1 has slope —i in 
an orthonormal fcosis. 
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Proof. We must show that GO'*^(.E) leaves I fixed. Intuitively, this comes 
from the fact that GO'*'(£7) is connected, so I cannot suddenly turn into J. 
Calculating in an arbitrary positively oriented orthonormal basis and putting 

^if) = (b 

8.8.6.3. Note. This calculation shows also that the eigen value o f I (resp. J) 

under / € O'*'(E) is equal to a + bi = ©(/) (resp. a — bi = ©(/)), where 0 is 
as in 8.3.6. Also, by 8.3.1 (ii), we have = J for / € 0~(Ej. 

8.8.6.4. Proposition. Let f € GL(£). Then 

i) / e GO(£) if and only if f'''’{(N'’)~^{0)) = (JV^’)~'(0), that is, if and 
only if f^ leaves the isotropic cone of E globally invariant; 

ii) if dim E = 2, 

f e GO+(£) <=> /"’(/) = 7, / € GO-(£;) f‘^(I) = J. 

Proof. The implications =>• are trivial. Assume that leaves invariant 
the nullspace of TV'"’; then the inverse image (f'''')*(N'^') of under f' 
(cf. 13.1.3.9) is a quadratic form with same nullspace as . By 14.1.6.2, 
this implies that there exists fc € C* such that [f''Y(N‘^') = kN*^'; in particu¬ 
lar, restricting to £ C E*'', we get ||/(a:)|p = fc||x|p for all z e .E. This shows 
that k is real and / is a similarity of ratio y/k, proving (i). For (ii), observe 
first that /({/, J}) = {/, J} by 8.8.6.1; thus we can apply (i) and observe 
that if / € GO'^(£7) (resp. / 6 GO~{E)) we have /(/) = 7 (resp. f[I) = J) 
by 8.8.6.2 and 8.8.6.3. □ 

8.8.7. LaGUERRE’S formula. This result is the fruit of the excogitations 
of Laguerre, who, not satisfied with his college teacher’s geometry course, 
took to himself to clarify it. 

8.8.7.1. Assume first that E is an oriented Euclidean plane and (7, J) are 
its isotropic lines in the order specified in 8.8.6.2. Let D and D' be lines in 
E, and D'' and D"‘ their complexifications in E'' (cf. 7.4). 

In the complex projective line P(E*’) = P{E‘^'), it is natural to consider 
the cross-ratio of the four points 77*", T?'*"', 7, J. Let / € 0'*'[E) be such that 
f{D) = D'\ by 6.6.3 and 8.8.6.3, we have 

177*‘,77"",7,J] = ©(/)/^. 

But since ©(/) € U, we have ©(/) = (©(/)) thus an application of 8.7.5.1 
and 8.7.7.7 gives an expression of (T?*"’, 7?""’, 7, J] in terms of oriented angles 
only: 

8.8.7.2. Theorem (Laguerre’s formula). We have 

|Z7*-.7?'*",7,J1= [0(a(^'))]". 

and, if t is an arbitrary measure of 7777', we have [77*", 77'*" , 7, J] = □ 
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For conscience’s sake, we verify that the formula behaves as it should when 
the orientation of £ is switched. In fact, I and J are permuted by 8.8.6.4, 
©(■) becomes (©(•)) by 8.3.6, and correct behavior is assured by 6.3.1. As 
for switching D and D', apply 8.7.7.3 and 6.3.1. For applications, see 17.4.2.2. 

8.8.7.3. Assume now that E is not oriented; the lines I and J are indistin¬ 
guishable. For two lines D,D' in E, the cross-ratio \D‘' ,D"', 1,J] can still 
be defined up to inversion in C. Define the bijection log : U\{ —1} —» )—tt, 7r| 
as the inverse of the restriction of A to ] — ir, ir[. Let f be a measure of DD' 
for an arbitrary orientation of E-, by 8.7.7.6, DD' is equal to t or tt — t, and 
in either case we have 

8.8.7.4 m^=^\\og{[D‘\D"\J,J])\, 

where the absolute value serves exactly to neutralize the ambiguity between 
log(e^*‘) and log(e^‘l’’“**) = log(e“^**) = — log(e^‘*). 
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Figure 8.8,7.3 


8.8.7.5. Formula 8.8.7.4, due to Cayley, is a good illustration of his prin¬ 
ciple (cf. 5.2.2); it displays the geometry of angles between lines of E, or, 
equivalently, the geometry of the metric of the projective space P(E), as a 
subgeometry of an appropriately chosen projective space (cf. 19.2). 


8.9. Quaternions and orthogonal groups 

Complex numbers played a fundamental role in the study of 0"''(2); 
similarly, the skew field H of quaternions will allow a fine study of 0"*'(3), 
and especially O''" (4). 

8.9.1. Review of quaternions. The quaternions form a skew field, 
denoted by H, and constructed by giving a multiplication defined in the 
following fashion; 

lu = u (u = = ii = = jj — = kk = —V, 

ij = —ji = k\ jk = —kj = i; ki = —ik = j. 

Here {l,t, j,k} denotes the canonical basis of R^, and the multiplication is 
extended to the whole of R^ by linearity. This operation is associative, but 
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not commutative. The subspace R - 1 is identified with R, and the subspace 

Rt + Rj + Rfc c H 

is identified with R^ and called the set of pure quaternions. Every quaternion 
9 e H can be written q — .S(q) + P{q), where .E(g) G R and P{q) S R®. By 
definition, the conjugate g of q is the quaternion JS(q) — P(q). One has 

f= 9 . q = q*=>q&B., g =-9 <=> g 6 R®, 

q + r = q + r, ^ = rq (watch out!), 
g e R® 9 ^ e R_, 9 e R •<=»■ g^ G R+. 

The canonical scalar product in R^ is recovered through the formula 

(7 k) = j(?»-+r 9 ); 

in particular, 99 £ R.^., and || 9 {| = is called the norm of 9 . One has 
IlgrII = Ikilklh shows that H is indeed a field, for we can take q~^ = 
g/llgll"^. We define 

5® = {9llkll = i}, 

the unit sphere of R'* = H, and embed in the space of pure quaternions 
as 

5^ = {9GR®ltl9||=l} 

We see that 5® has a multiplicative group structure, just like 5^ = U = { z | 
|z| = 1} is also a group (but 5® is not abelian). In this context it is natural 
to ask whether higher-dimensional spheres can have a group structure, good 
in a specified sense; the answer, obtained with the aid of algebraic topology, 
is no: only 5* and 5® have good group structures (see references in |PO, 284) 
or |HU, chapter 15j). The sphere has an interesting structure, it’s almost 
a group; this is obtained using Cayley numbers (cf. [PO, 278]). Nor should 
we expect to find reasonable field structures on R" for n other than 2 or 4; 
the only field structures are really C on R^ and H on R^ (cf. [PO, 284]), 
although R® again has a reasonable structure, the Cayley numbers. For R”, 
see 8.10.3 and the references contained there. 

Observe that if 9 , 9 ' are pure quaternions, the pure part P(qq') of 99 ' is 
exactly the vector product 9 X 9 ' in R® (see 8.11.13). 

Contrary to the case of C, the automorphisms of H are simple and well- 
known; see 8.12.10 and compare 2.6.4. 

8.9.2. Theorem. 

i) LetsGR®\ 0 . Then q t—> —sqs ^ leaves R® invariant, and its restriction 
to R® is the reflection through the plane o/R®. 

ii) Let s £ H* and p'„ '■ q >-* sqs~^. Then p’ leaves R® invariant, and its 

restriction p„ = ]r3 belongs to 0"*'(3). Moreover, p, — p,i if and only 

if there exists A G R* such that s' = As. 

iii) Conversely, for every f £ O'*" (3) there exists s £ H* such that f = p,. 
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Proof. To see that R® is invariant, use the criterion q 6 R® <=> € R_: 

(—S9s~*)^ sq^a~^ £ R_. 

The map q >-♦ —sqs~^ is certainly linear and length-preserving: 

for all q. Finally, s >-♦ —= —s and if g £ R® is orthogonal to s we have 
0= {q\s)= ^( 9 ^ + 39 ) = ^(- 9 S - sq), 
so that sq = —qs and 

q I—* —sqs~^ = ss~^q = q. 

To study p', and p,, one proceeds as above and concludes that p, £ 0(3); 
there remains to see that p, £ O'*’(3). This is clear topologically, since H* 
is connected and the map taking s £ H* into the map q >-♦ sqs~^ in 0(3) is 
continuous, so p„ is in the same connected component of 0(3) as IdR. 3 , which 
is the image of s = 1 under this map. The “moreover” clause follows because 
the only elements s £ H that commute with all elements of R® are obviously 
the real numbers. Finally, (iii) follows from (i) and 8.2.12. □ 

8.9.3. Corollary. The map p : 5® 9 s I—♦ P„ e O'*'(3) is a continuous 
surjective group homomorphism. Its kernel is {±1}. In particular, 0'*'(3) is 
isomorphic to P®(R). 

Proof. Recall that 5® = { 9 £ H ) || 9 || = 9 }. The kernel is the set of s £ R 
such that ||s|| = 1 , by 8.9.2 (ii). The quotient of 5® by the subgroup {± 1 }, 
or, equivalently, by the equivalence relation z ~ y <=> y = ± 1 , is exactly 
P®(R) (cf. 4.3.3.2). □ 

8.9.4, Proposition. Let s = a + t with f £ R® \ 0 and a £ R. The axis 
of the rotation p, £ 0"*’(3) is the line Rf, and the angle 6 of p,, 9 & [0, tt), is 
given by tan^/2 = ||t||/|a| if a ^ 0, and 9 = if a — 0, in which case p, is 
the reflection through R<. 

Proof. It is clear that R< is invariant. The calculation of 9 is more delicate; 
observe first that p,,,--* = PjP.,(Pj)’"^ and p, have the same angle, and since 
O’*’(3) acts transitively on S^, there exists z with zsz~^ = /Si, for £ R. 
This z can be calculated for an an arbitrary s = a + ^i. 

The angle of p, will be read in i"*- D R®, that is, 9 = j, p,j. But we have 

py = (a -I- I3i)j(a + ^i)~' = (a^ 4 4 Pi)j(oL - jSi) 

= {a^ + 0T^((a^-0^)3 + 2apk). 

Thus 

tan 9 

and tan S /2 = /?/a = ||<| 1 /|q! 

Finally, a = 0 implies p,(y) = —j. □ 


2a;g _ 2[P/a) 

q,2 _ ^2 J _ (^/a)2 ’ 
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8.9.5. The importance of 8.9.3 and 8.9.4 is that it gives a parametric rep¬ 
resentation for 0"''(3), the group of rotations of our physical space; such 
parametrizations are fundamental in mechanics and physics, but their are 
not easy to find or deal with. Consider, for example, Euler angles (jSY, 286- 
288]), and try to write the formula for the composition of two rotations! The 
results in 8.9.3 and 8.9.4, on the other hand, allow us to deal easily and neatly 
with composition. There is just one drawback one would like to get rid of: 
the ambiguity caused by the non-trivial kernel {±1}. This, however, cannot 
be avoided, in view of the following result: 

8.9.6. Proposition. No right inverse f : 0'^(3) —» 5® for p is either 
continuous or a group homomorphism. 




Proof. Let r be the circle consisting of rotations around the axis Ri. By 8.9.4, 
we have p~*(r) = T' = 5(R + Rt), the unit circle in the plane R + Rt C H. 
The proof of 8.9.4 also shows that the restriction p|r' : P' —* T is exactly 
the map z z^, under the identification R + Ri = C. Suppose there exists 
/ : 0'^(3) —♦ with p° f = Id,)+( 3 |, and look at the restriction /|r. If 
/ is a homomorphism, we must have /(Id) = 1, then /( — I) = ±i because 
p(/(l)) = (/( —l)) = —1 by the above. Choosing /(—I) = », for example, 

we get 

(/(- I ))'= /((-!)")=/( 1)-1 = ‘" = - 1 . 

a contradiction. 

Now suppose that / is continuous and /(Id) = 1 (the case —1 is treated 
analogously). By continuity, f(z) = y/z, the usual determination of the 
square root, on C \ R_; but then, when z approaches —1 within the upper 
(resp. lower) half-plane the square root f(z) approaches i (resp —i), contra¬ 
dicting continuity. 
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8.9.7. As in the proof of 8.9.2, we see that, for s, r € H* satisfying ||s||||r|| = 1 
(and, in particular, if ||s|| = ||r[| = 1), the map H 9 g sqr 6 H is an 
isometry of H, and, in fact, belongs to O'*'(4), as in 8.9.2 (ii). We actually 
have more: 

8.9.8. Theorem. The map 

T : X B (^)»') ^ {9 ^ * 9 *^} £ ^^( 4 ) 

is a continuous, surjective group homomorphism. Its kernel consists of two 
points {1,1) ond(—1, — 1). 

8.9.9. Proof. The set 5® X 5^ is obviously being considered with its product 

group structure. We have r(ss', rr')(9) = ss'qrr' = s{s'qr')f = (T(s,r) o 
T{s',r')){q). The kernel is formed of pairs (s, r) such that sqr = q for every 
q; we must have r = so = q for every q, and s 6 R, so s = ±1. 

To show surjectivity, take / € O'*" (4) and qo = /(!)• Then T{q~^, l) € O'*" (4) 
and T(gQ’,l)(go) = Ij so r(g(J'^,l)o/ e 0'''(3). But by 8.9.2 (iii) we have 
’■(90 ° / - P', for s e H*, so / = r(go> 1) °p' = »'(9n«>s“‘)- 

8.9.10. Corollary. The group 0'*'(4) has normal subgroups other than 

itself and its center {± Idf^t}—for example, r({l} x 5^) and r(5® x {1})- In 
particular, O'*" (4)/{± Id|i4 } is not simple. □ 

8.9.11. Remarks. This concludes our investigation of the simplicity of 
0(n), started in 8.5. 

We see that O'*" (4) is almost a direct product. The real direct product 
is the orthogonal projective group PO'''(4): 

8.9.12 PO+(4) = p(r(5=’ x {l})) x p(r({l} X 5")). 

Equation 8.9.12 has nice geometric consequences, cf. 18.8.8. For other appli¬ 
cations of quaternions to algebra, see |VL], a very interesting book. 

8.9.13. Question. The complex numbers C have afforded a detailed study 
of O’*" (2), and the quaternions a study of O'*" (3) and O'*" (4). Thus we have 
worked our way up to dimension four. How does one parametrize 0'*'(n)? 
The answer is given by Clifford algebras, cf. 8.10.3. 

8.10. Algebraic topology and orthogonal groups 


8.10.1. This section closes the plan announced in 8.2.14. It assumes some 
familiarity, at least, with algebraic topology. We mention ere passant that 
each O'*' (n) is a smooth manifold and a Lie group; in fact, it is exactly using 
Lie group techniques that one answers the problems in 8.10.4, among others. 

8.10.2. Particular cases. We have seen that 0'''(2) is the circle 5*, 
O'*" (3) is the real projective space F®(R), and O'*'(4) can be interpreted 
as (5** X S^)I7i2 by 8.9.8. We mention that O'*'(5) is related to ([POi 
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corollary 13.60]), and 0‘'‘(6) is related to ([POj prop. 13.61]). See also 
|DEl, 106-116]. Finally, 0"*'(8) gives rise to a “triality” .phenomenon, due 
to the fact that the quotient between outer automorphisms Aut(0'''(8)) and 
inner automorphisms Int(0"*'(8)) is isomorphic to the symmetric group S 3 
(cf. |CE2, 119]): 

Aut(0+(8))/Int(0+{8)) “ Ss- 

8.10.3. Fundamental group. Since 0'*'(2) is homeomorphic to the circle 
5*, the fundamental group (2)) is isomorphic to Z. On the other hand, 

7ri(0+ (n)) = Z2 for every n > 3. For n = 3 this follows from 8.9.3, since p is 
a twofold cover and is simply connected; the case n > 3 follows from n = 3 
and the exact homotopy sequence of the fiber bundle obtained by writing the 
sphere 5" as the homogeneous space S" = 0+(n + l)/0"''(n) (cf. 1.5.9 or 
8.2.8). It is interesting from the geometric point of view to find the non-sero 
element of Tri(0'''(3)) (and hence of jri(0'’’(n))). The answer is afforded by 
the proof of 8.9.6, which shows that the circle F, consisting of rotations of 
R® around a fixed axis, is a non-trivial loop in O'*'(3), since 5® is simply 
connected. 

Observe also that the loop F, described twice, becomes null-homotopic, 
since jri(0'*'(3)) = Z^. This can be demonstrated either with a paper band 
(figure 8.10.3.1) or by the trick of the soup bowl (figure 8.10.3.2). The per¬ 
former moves his arm so as to rotate a soup bowl by 27r around its axis, 
always holding it right side up on his hand. His arm is then twisted. The 
surprising thing is that by repeating the same operation he now untwists his 
arm, instead of twisting it by another 2jr! See ]PO, 2]. 
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But the fact that 7ri(0+(n)) is non-trivial has a much more important 
consequence. Indeed, one can lift the group structure of G = to its 

universal cover G, a simply connected topologic space. We obtain an abstract 
group 0+(n), and one can ask whether this group has a geometric realization. 
For n = 3, we have realized 0+ (3) as a subset of the quaternions H, the group 
operation being multiplication in H. What is the analogue of H for n > 3? 
The answer is Clifford algebras, and G+(n) is called the group of spinors. See 
[PO, chap. 13], [BI2, §9], and the references in 13.7.14.4. 

It is galling to realize that spinors themselves did not come out of these 
lucubrations, but were only discovered by Elie Cartan in 1913, when he clas¬ 
sified primitive irreducible linear representations of the Lie algebra of 0'*'[n). 
One of these representations, hitherto unknown, had the unexpected dimen¬ 
sion 2”; it has since been known as the spinor representation. Clifford alge¬ 
bras, on the other hand, were discovered in 1876, while Clifford was trying to 
extend theorems like 8.9.2 and 8.9.8 to higher dimensions. 

8.10.4. Other algebraic topological structures associated with O'^ {n) (ho¬ 
mology groups, cohomology ring, etc.) are all completely known. See, for 
example, [BOl], and also [HU, 92-95]. 
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8.11. Canonical volume form, mixed product and 
cross product 

Recall that if n = dimi^ we have dimA”^^* = 1 (cf. 2.7.2.1). 

8.11.1. Lemma. If E is a Euclidean space, K”E* has a canonical Euclidean 
structure (here, a norm), defined by 

= lw(e,,...,e„)| 

for every ui 6 and every orthonormal basis (e^) of E. 

Proof. Since uj 6 h”'E* is determined by its value on a basis of E, it is 
enough to show that , e„) j = j(eii - ■ • > for two arbitrary or¬ 

thonormal bases. Defining / € GL(£) by /(ei) = e[ for t = 1,..., n, we have 
w(ej,..., e^) = (det f)ui{ei,..., e„), and det / = ±1 by 8.2.1. □ 

8.11.2. Note. More generally, the exterior, tensor and symmetric algebras 
A''E, A’‘E*, &’E, &'E‘, QfE, (y'E* have natural Euclidean structures for 
every p, inherited from the structure of E (cf. |BI2, 115]). In this book we 
only consider the case A"E*. 

8.11.3. Definition. Let E be an oriented Euclidean vector space. The 
canonical volume form on E is the element Xe G A"E* characterized by 
II'^kII = 1 (cf. 8.11.1) and by assigning a positive value to any positively 
oriented basis of E (cf. 2.7.2.2). For n = 3 we also call Xe the mixed product, 
and sometimes write Xe( , -, ) = ( > i )• 

Let E be a Euclidean vector space. The canonical density on E is the 
map Se '■ E” —* R+ defined by (ei,...,e„) —» |A„(ei,..., e„)| where Xe is 
the canonical volume form on E for an arbitrary orientation. 

In other words, Xe and 6 e are characterized by 

8.11.4. ^^(ei,..., e„) = 1 (resp. Xsici, .. ■ ,e„) = 1) for any orthonormal 
basis (resp. positively oriented orthonormal basis) (e,). 

8.11.5. In order to calculate Se(xi,. .. ,x„), we introduce Gram determi¬ 
nants. If £■ is a Euclidean space and (ij)i=i.is a subset of E with p 

elements, the Gram determinant of (xi) is the scalar 

(xi I X2)' • (xi I X,, 

\\x 2 f ■■■ (X2|xp 

(x,, |X2) ||x,,f 

8.11.6. Proposition. For every finite subset (xi)i=i.„ of E one has 

(5e(xi, . • ■, x„) = (Gram(xi,...,x„))‘'^^. 


Gram(xi,..., x,,) = det((x< | x,)) = 


11^1 r 

(X2 |xj) 

(x,, |xi) 
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Proof. Let (ej) be an orthonormal basis, and choose / e GL{E) such that 
/(e,) = X, for all t = 1,..., n. Let A = M(f) be the matrix of / with respect 
to the basis (e,). By Lemma 8.8.2, 

Gram(xi,..., x„) = det(*j4i4) = (det AY = (det /)^, 


whence 

5£;(xi,...,x„) = |det/|($£;(ei,...,e„) = |det/l. □ 

8.11.7. Examples. Take x,y & E and apply 8.11.6 to the Euclidean space 
V = Rx + Ry. We have 6 v (2C,J/) = (Gram(x, y)) ^ , so Gram(x, y) > 0; but 


Gram{x, y) = 


ll^f 

(y 1^:) 


(^|y) 

llyf 


l|xf||yr-(^|y)^ 


proving the Schwarz inequality (cf. 8.1.3). Moreover, by 8.6.3 we see that 
6 v (x, y) = sin(Rx, Ry)||x||||y||, where the angle between lines is non-oriented. 

In the case of three vectors, the reader can try to prove directly that the 
Gram determinant Gram(x, y, x) is always positive. 


The rest of this section is devoted to cross products, an operation used 
all the time in everything three-dimensional that has to do with our physical 
universe. 

FVom now on E is oriented. 


8.11.8. Proposition. Let E have dimension, n > 3. Given n — 1 vectors 
(x,),=i.„_i of E, there exists a unique vector xi x • • x Xn-i of E satisfying 

(xj X • • • X Xfi —1 I y) = (®i) • • •) 1) y) 

for every y E. This vector is called the cross product of the x,-. The cross 
product possesses the following properties: 

i) the map 9 (xj,..., x„_i) ij x • ■ • X x„_ j € E is a multilinear 

alternating form; 

ii) Xj X ■ ■ ■ X Xn_i = 0 if and only if the Xi are linearly dependent; 

iii) xi X X x„_i e (Rxi -1 + Rx„_i)-^; 

iv) if the vectors (xi) are linearly independent, the set {xi,..., Xn-i, xi x 

• X x„_i} is a positively oriented basis for E; 

v) ||ii X X x„_i|| = (Gram(x],...,x„_i))‘'^^ 

~ I^Ril H-h 1 (®11 ■ • • ) Xn— I ); 

vi) ||ii X ■ • X is uniquely characterized by (iii), (iv) and (v) together. 



Figure 8.11.8 
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Proof. Existence and uniqueness follow from 8.1.8.1. Part (i) comes from the 
fact that Xe is multilinear and alternating. One direction of (ii) is trivial; to 
prove the converse, assume that the z, are linearly independent, and adjoin 
to them another vector x„ to form a basis. We have 

(xj X - X Xn~\ I Xfi) = A^(xi , .. ., Xn —I, Xn) ^ 0, 

so the cross product cannot be zero. For (iv), apply 2.7.2.2 to the equation 

Ab(xi,... ,x„_i,xi X ■ ■■ X x„_,) = ||xi X ■ X x„_i|p > 0. 

For (v), put 0 = xi X ■ ■ • X x„_i, and observe that (z \ Xj) = 0 for every 
i = 1,..., n — 1, and that, by the definition of Gram determinants, 

Gram(xi,...,x„_i, 0 ) = Gram(xi,, x„_i)|| 0 ||^, 

so that 

(Ab(xi,...,x„_i,z))^ = llzil^ = Gram{xi,...,x„_i, 0 ) 

= Gram{xi,...,x„_i)|| 0 (|^ 

(cf. 8.11.6). Finally, to show that (iii), (iv) and (v) characterize the cross 
product, one only has to consider the case when the Xj are linearly indepen¬ 
dent (otherwise the product is zero); but then dim(Rxj + ■ ■ ■ + Rx„_i) = 1, 
and (iv) and (v) do indeed determine a well-defined vector on this line. □ 

8.11.9. Remarks. For n = 2 , the “cross product" would simply be the map 
E B xy-* d(x) € E, a. rotation by a right angle (cf. 8.3.12 and 8.7.3.5). 

The cross product affords an easy method to complete an orthonormal 

set {ei},=i.n_i into a positive orthoiiormal basis. The practical calculation 

comes in 8.11.11, 8.11.12. 

8.11.10. The cross product in coordinates. Let (e^) be an orthonor¬ 
mal basis for E, and let A = (xj,... ,x„_i) = (x,y) (i = 1, ...,n — l;j = 
l,...,n) be the matrix whose columns are the vectors x,- in the basis (e,). 
Then 

8.11.11 i-th coordinate of Xj x • x x„_i = (—1)"~* det j4,, 

where A, denotes the square matrix obtained from A by deleting the t-th 
line. To see this, put z = Xj x ■ • x x„_j = (zi, • • ■, z,,), and expand the 
determinant giving (z | y) with respect to its last column: 

*11 Zn-i.i yi 

(z I {/) = AB(xi,...,x„_i,y) = : : : 

^In — Vn 

= ^(-l)"“'y, det A, = 


This completes the proof, since equality holds for any yi. 


□ 
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H n = 3, for example, we find 



8.11.13. Quaternions. Consider R® with its canonical Euclidean struc¬ 
ture and orientation. The cross product affords an easy way to calculate 
the product of two pure quaternions x, y £ R® C H (and, by extension, the 
product of any two quaternions). By 8.9.1 and 8.11.12, xy = —(x | y) -|-1 x y, 
with (x I y) 6 R C H. 


8.12. Exercises 


* 8 . 12 . 1 . Let £ be a finite-dimensional real vector space, (j) and ip two Euclid¬ 
ean structures on E, and G C GL(£) a subgroup of the linear group of 
we assume G is irreducible (cf. 8.2.6). Show that if G C 0{E, <f>) D 0{E, ip) 
(i.e., if every element of G leaves <p and ip invariant, cf. 13.5), then ip and ip 
are proportional. 

* 8.12.2. A GEOMETRIC PROOF FOR 8.2.15. To prove that / € 0{E) 
always leaves some line or plane invariant, consider some x e S{E) such that 
II/(x) — x|| is minimal, and show that x, f[x) and /^(x) lie in the same plane 
(cf. 9.3.2; use 18.4). 

8.12.3. A CALCULUS PROOF FOR 8.2.15. The data being the same as for 
8.12.2, prove that the plane determined by x and f(x) is invariant under / 
by showing that the derivative with respect to i of the map 

5(£)9y--||/(y)-y|| eR 

is zero (cf. 13.5.7.2). 

8.12.4. Prove directly that a circle is path-connected. 

8.12.5. Normal ENDOMORPHISMS. An endomorphism / of a Euclidean 
space E is called normal if it commutes with its adjoint: */ ° / = f °*f 
(cf. 8.1.8.6). Show that 8.2.15 holds for any normal endomorphism. (See also 
13.5.7.) What are the normal endomorphisms when dim£' = 2? 

8.12.6. Determine all continuous homomorphisms R —» GO(£), where 
dim £■ = 2. Draw the associated orbits (cf. 9.6.9). Use 8.3.13. 

* 8.12.7. Show that, for every n € N* and every a € A(E), the equation 
ni = a has exactly n solutions in A(E). Draw the solutions on a circle for a 
few values of o, with n = 2, 3,4, 5. 
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8.12.8. Table of trigonometric formulas. Prove the formulas below, 

SIQ X 

where n G N and a, 6, c G R are arbitrary. As usual, we define tan x =-. 


(”) tan a - Q) tan^ a + • • • + (-1)'’ (ap+i) tan^P+i a + 

l-(;)tan’a+... + (-l)»(ytan»'a+... 

o+fc a — b 
cos a + cos b — 2 cos —r— cos —r— 

/L 

a+b . a-b 
cos a — cos 6 = —2 sin —r— sin —r— 


(state similar formulas for the sum and difference of sines); 

I. + *•) 


tan a tan b = 


cos a cos b 


Show that the maximum value of a cos t + 0 cost, for < G R, is In 

the next four formulas, assume a, b, c positive and o + 6 + c = jr: 

... a b c 

sin a + sin 0 + sin c = 4 cos - cos - cos — 

I ^ . a . b . c 

cos a + cos 0 + cos c = 4 sin - sin - sin - 

At jL /• 

tan a + tan b + tan c = tan a tan b tan c 

b , c c a a b , 

tan - tan - + tan - tan — + tan - tan - = 1 

2 2 2 2 2 2 

VlO+^. cos^ = |yCb^/ 27 ^^ 

sin(n+l)f no 

1 + cos 0 + • • • + cos no = -i—r —- cos — 

sinf 2 
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lim cos — cos — • • • cos — =-. 
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* 8.12.9. Cross products in R®. For every a, 6, c e R®, prove the following 
formulas: 

a X (fc X c) = (a I c)b — (a [ fc)c, 

(axb.ax c,b X c) = (a,fc,c)^ 

(a X i) X (a X c) = (o, b, c)a. 

Show that R^, endowed with the operations of addition and vector product, 
is an anticommutative algebra which, instead of being associative, satisfies 
the Jacobi identity 

a X (b X c) + b X {c X a) + c X (a X b) = 0. 

Such an algebra is called a Lie algebra. 

If p, 9 , r denote the projections from R^ onto the three coordinate planes, 
show that ||a X = Gram(p(a},p(6)) + Gram( 9 (a), g(b)) + Gram(r(a), r(6)). 
Find a geometrical interpretation for this result (see figure 8.12.9). Generalize 
this result for Gram(ai,..., a,,) in a space of dimension n, where n and p are 
arbitrary. 



Study the equation x x a = b (for a and b given); find whether there is 
a solution and whether it is unique. 

* 8.12.10. We call a bisector of two half-lines A, B in a Euclidean vector space 
the unique half-line that is a bisector of A and B in the plane generated by 
A and B, and lies between the two half-lines (cf. 8.7.5.2 and 8.7.3.3). Let 
S, T, V be three half-lines in a 3-dimensional Euclidean vector space. Find 
three half-lines A,B,C such that 5 is a bisector of A, B, T is a bisector 
of B, C, and U is n bisector of C, A. Study possible generalizations of this 
problem: replacing half-lines by lines, considering more than three lines, or 
considering higher-dimensional spaces. 

* 8.12.11. Automorphisms of H. Show that every automorphism of H is 
of the form a —» .S(a) -I- p(P(a)), where p € C>'''(3). 
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8.12.12. Compute some explicit compositions of rotations. In particular, 
calculate the composition of the rotations of order three and five of the 
icosahedron given in coordinates in 12.5.5.3. 

8.12.13. Show that every finite-dimensional linear representation of a com¬ 
pact group is seniisimpie. This means that, given a compact group G and 
a homomorphism f : G GL(F) from G into the linear group GL(y) of a 
finite-dimensional real vector space (a representation), any vector subspace 
W C V such that f(g){V) = V for all 9 € G admits a direct sum complement 
W such that f(g){W) = W for all y € G. 

8.12.14. Let {ei}j=].„ orthonormal basis for an n-dimensional Eu¬ 

clidean vector space. Determine explicitly the reduced form (cf. 8.4.1) of the 
element / of 0 (n) defined by /(cj) = for every i = 1 ,..., n — 1 and 
/(e„) = ei- 




Chapter 9 

Euclidean affine spaces 


Euclidean affine spaces, in two and three dimensions, are the 
objects of classical geometry and correspond to our physical uni¬ 
verse. Their structure is rich and has been studied for thousands 
of years, starting with the Greeks; as a consequence, there is a 
wealth of results and we’ve had to choose from among them. 
The choice is consistent with the rest of the book: we start with 
a quick discussion of the foundations, then offer results that are 
simple and nice, but at the same time not trivial to prove; and 
finally we give an overview of related questions, more difficult 
and perhaps still under investigation. 

The first three sections include, aside from basic definitions 
and results, formulas for the explicit calculation of distances (sec- 
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tion 9.2), in which Gram determinants play an essential role 
(they will be encountered again in section 9.7). Section 9.4 ap¬ 
plies the results from section 9.3, on the structure of plane isome¬ 
tries, to the problem of polygons of minimal perimeter inscribed 
in a convex polygon, which is relevant to the study of trajectories 
of light rays or billiards, among others. This problem is in some 
sense opposite to the question of ergodicity of billiards. 

Section 9.5 studies similarities and gives non-trivial charac¬ 
terizations for them, in particular Liouville’s theorem. Section 
9.6 uses plane similarities in the solution of several problems: 
similar divisions, the double pedal curve of two circles, and log¬ 
arithmic spirals. 

The last seven sections are quite varied and are partially 
meant to prepare the material for the remainder of the book, 
but they also introduce fundamental notions. Section 9.7 stud¬ 
ies the relationship among distances between several points, and 
mentions the problem of giving a purely metric characterization 
of affine spaces, in which the Cayley-Menger determinant play 
an essential role. Section 9.8 studies subgroups leaving invariant 
a fixed subset, developing the connection between compactness 
of the subgroup and the existence of a common fixed point. Sec¬ 
tion 9.9 discusses the length of a curve and the shortest path be¬ 
tween two points. In section 9.10 we use differential calculus to 
derive the distance as a function of its endpoints (first variation 
formula), and we deduce some applications. In 9.11 we extend 
the metric to the so-called Hausdorff distance, defined on the set 
of compact subsets of the space; this will be used in chapter 12. 
Section 9.12 defines the canonical measure of a Euclidean affine 
space, and the derived notion of the volume of a compact set 
(which is really our everyday notion of volume); we meet them 
again in chapter 12. Finally, section 9.13 discusses Steiner sym- 
metrization, which has played an important historical role and 
is still used in the proof of the isoperimetric inequality (chap¬ 
ter 12); in this chapter we use Steiner symmetrization to prove 
Bierberbach’s isodiametric inequality. 

In addition to the exercises given in 9.14, we recommend that 
the reader leaf through the numerous ones given in [HD], [R-Cj, 
[FL], [PE], [CRlj. Some of these have a great graphic interest. 
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Unless we state otherwise, X stands for a Euclidean affine space of finite 
dimension n. 


9.1. Definitions. Isometries and rigid motions 

9.1.1. Definition. A Euclidean affine speu:e is an affine space [X,X), 

where X is a Euclidean vector space. An affine frame .n of X is 

called orthonormal if {xo3:i}i=i.n « cm orthonormal basis for X. We endow 

X with the metric given by d(x,y) = xy = ||xy||. 

z 

y 

Figure 9.1.1 



9.1.1.1. The strict triangle inequality. As can be immediately seen from 
8.1.3, the equality xz = xy+yz holds if and only if z e |x, y], the ixrmsegment 
joining x and y (cf. 3.4.3). 

9.1.2. Remarks. The topology generated by this metric is the canonical 
topology of X (2.7.1.1). An (affine) subspace S C X inherits from X a natural 
Euclidean affine space structure. 

The standard n-dimensional Euclidean affine space is X = R", where 
X = R” is the standard n-dimensional Euclidean vector space (8.1.2.2). An 
isometry between two Euclidean affine spaces X, E is a bijection f •. X —* Y 
that preserves distances: d(/(x),/(y)) = d{x,y) for all x, y £ X. Every n- 
dimensional Euclidean affine space is isometric to the standard space R". To 
see this, take an orthonormal frame {x,} for X and define / : X —» R" by 

/ f ^0 "b ^ ) ~ (^1) ' * • I ^n). 

This justifies the expression “n-dimensional Euclidean space”, but we shall 
not use it too often. 

The set of isometries of X is denoted by Is(X) (cf. 0.3). IVanslations 
and, more generally, any / € GA(X) such that f £ 0(X), are isometries: we 

have d{f{x),f{y)) = ||/(x)/(y)|| = ||/(.xy)|| = |lxy|| = d(x,y), by 2.3.2 and 
8.1.5. All isometries are of this type; 

9.1.8. Proposition. A map f : X —* X lies in ls(X) if and only if 
f £ GA(X) and f £ 0(X). 
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Proof. Pick an arbitrary a € Jf, and let t = (-, be the translation by 

/(“)“ 

the vector f(a)a. Then g = t o f 6 Isa(.X^), because T(X) C Is(X), where 
T(X) is the set of translations of X. It suffices to show that g e O(Jfa). 
Calculating in Xa, and applying the definition of an isometry atnd 8.1.2.4, we 
get (g(a:) |3(y)) = (a: |y) for every x,y & X. This implies g € 0(Xa) by 8.1.5. 

□ 

9.1.4. Proposition and definition. Put ls*(X) = { / e Is(X) | / e 
0*(X) }. The elements o/Is'''(A') are called (proper) motions (and those of 
Is~(Jf) are sometimes called “improper motions”). For every a S: X, we have 
the semidirect products 

h(X) = T{X)h4X), ls+(X)^T(X)hUX), 

and the set-theoretical product 

Is-(X) = T{X) X Is„-{X). 

Moreover, the correspondence f f induces group isomorphisms Is„(X) = 
0(X) and Is^(A') S O'^(X). In the topology induced by the topology of 
GA(X) (cf. 2.7.1.4), the space Is(A') has exactly two connected components 
Is'*'(Jf) andIs~(Jf), which are path-connected. □ 

It is the path-connectedness of Is'*' (X) which justifies the term “motion” for 
/ e Is‘*’(JV), because there exists a continuous homotopy F : |0, l) —♦ Is'*’(A') 
joining F(0) = Idx and F(l) — f. It k this F which, properly speaking, is a 
motion of X. This is more intuitive if we apply P to a fixed subset C of A" 
and draw the images C(t) = P{t)(C). 



9.1.5. We can thus write X as a homogeneous space X = Is(Jf)/Is,i(Jf), 
where a k given. Thk is often more interesting than the other homogeneous 
space decompositions X = X,, and X = GA(Jf)/GA„(X), because in the 
first case, the definition of an affine space, the stabilizer is trivial, and in the 
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second the stabilizer GAa(J(f) is too big, whereas Isa(X) is compact. But 
Is(X) is big enough in the following sense: 

9.1.6. Proposition. Is(X) acts simply transitively on ortkonormal frames 
of X. Its action is 2-transitive (cf. 1.4.5.) in the following sense: for any 
four points a,b, a', b' & X such that a'h' = ab, there exists f € Is(Jf) such that 
f(a) = a' and f{b) = b'. If, moreover, dimX = 2, there is exactly one such 
f in Is+(X). 

Proof. This follows from 8.2.7. □ 

9.1.7. Note. We have singled out the fact that Is(X) is 2-transitive because 
this apparently unremarkable property is in fact extremely strong. Under cer¬ 
tain additional regularity assumptions, metric spaces whose isometry group is 
2-transitive can be classified. Besides affine spaces, we shall encounter three 
other examples in the book: spheres (chapter 18), real projective spaces and 
hyperbolic spaces (chapter 19). The other possibilities are the complex and 
quaternionic projective spaces, the octonionic projective plane, and their non¬ 
compact counterparts. See (BU2, 95], a recent reference, and also |B-K, 117] 
and 1PV2]. 

9.2. Orthogonal subspaces. Distances between 
subspaces 

9.2.1. Metadefinition. The notions of orthogonality, angle and oriented 
angle between (affine) subspaces and oriented lines of X are extended from 
the vector case by considering the directions in X of the relevant objects. The 
same notations are used as in the vector case. 

For example, if D,D' are two lines of X, their angle (in |0,7r/2|) will be 

denoted by £)£>' = DD', and orthogonality between two lines D and D' is 
indicated by £> ± B'. 

9.2.2. Proposition. Let S be a subspace of X and X & S a point. There 
exists a unique subspace T containing x and such that X = S®-^T. The point 
S D T (cf. 2.4.9.4) is characterized by xy = inf {xz \ z & S }. This number, 
denoted by d[x,S), is called the distance from x to S (cf. 0.3). The distance 
xz is a strictly increasing function of the distance yz. 

Proof. Since ( xy | yz ) = 0, we have xz^ = xy^ -f yz"^ by 8.1.2.4, and the 
function t i—► yl-f is strictly increasing. □ 

9.2.3. Remark. We have observed en passant that if x,y,z are distinct 
points of X the condition xy^ -^yz^ — xz^ is equivalent to (i, y) ± (y, z). This 
is the so-called Pythagorean theorem, which gives rise, in axiomatizations of 
Euclidean geometry, to some very involved demonstrations. 
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Figure 9.2.3 


9.2.4. Projections and reflections. Fix a subspace S of X. Propo¬ 
sition 9.2.2 yields a map jt*: ; 9 11 -» y 6 5, called the orthogonal projec¬ 

tion onto S. The reSection through S is the map Of; : X X defined by 

cr.(;(x) = X -f 2xir<f(x) (or by the condition that X 5 (x) is the midpoint of x 
and <T^s'(i))- We have o-.s- € conversely, every involutive isometry / is 

a reflection through an appropriate subspace, as can be seen by noting that 
/ leaves the point (x -f /(x))/2 invariant, for arbitrary x and applying 8.2.9. 
If 5 is a hyperplane, as is called a hyperphne rejection. 



9.2.5. Distance between subspaces. Let 5 and T be subspaces. We 
define the distance between S and T (cf. 0.3) as 

d{S, r) = inf { s< I s € 5, t 6 T }. 

The infimum is always achieved for some s € 5 and t €: T; two such points 
are characterized by the condition st € (5)'^ n(T)-*- (if 5nr = 0). The pair 
(s,f) is unique if and only if 5 fl r = {o}. 

The reader can supply an algebraic proof of the existence of the pair 
(s, f), or use a compactness argument. As for the condition st 6 (S') 
it is necessary by 9.2.2. Conversely, if it is satisfied, consider the subspace 
S' parallel to S and containing t (cf. 2.4.9.2). Take x € S, y € T, and 
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y' = jrs->(a:). We have [xy' | y'y) = 0, so xy > xy' = st. Uniqueness follows 
from the criterion st e n (T)-*-, for if st = s't' = d(S, T), we must have 
ss' = tt' & S nf. 

9.2.6. Computations. We have collected below some formulas that allow 
the solution of a good number of computational problems, including three- 
dimensional ones. We have not included formulas involving orthonormal bases 
and coordinates, as they follow from the ones given below and 8 . 1.2. 6 , 8.11.5.8 
and 8.11.11. We assume X to be oriented so as to be able to use all the results 
from 8.11. 

9.2.6.1. Let 5 be a subspace of X, x G: X a. point and . k a (not 

necessarily orthonormal) affine frame of S. Then 

9.2.6.2 

Gram(a:os(,... ,xna:fc) 




Figure 9.2.6.2 

Proof. Put y = TTsfr). By 8.11.6, Gram determinants do not change when 
we add a multiple of one column to another, so we have 

Gram(ii72, io^i,..., xo^) = Gram(xt/, xqXi, .. ■, xoXfc). 
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But the right-hand side is equal to ||xy|| Grain(xoXi,..., xoXfc), <is observed 
in the proof of 8.11.8 (v). □ 

9.2.6.3. Let H = /~*(0) be a hyperplane of X, where / € A(X;R) 
(cf. 2.7.3.l). H X € X is a point, we have 

9.2.6.4 d(x,R) = J^^, 

ll/ll 

where || • || denotes the canonical norm in (X)‘ (cf. 8.1.8.2). 

Proof. Put y = na{x). We have f{y) = 0, so /(x) - f(y) = f(x) = f(^). 
But i/(yx)| = |(/' 1^)1 = i|/'|l||^ll = ll/ll ly, since yx € so the vectors 
/• and ^ are proportional. We then apply 8.1.8.1, 8.1.8.2 and 8.1.3. □ 

9.2.6.5. Let D, D' be lines of X and a,b (resp. a', b') points in D (resp. Zf). 
Then 

(P(D D') = 

Gram(a6, o^) 

This Ls proved just like 9.2.6.1, by introducing s,t such that d(D,D'). The 
line {s,t), which is in general unique, is called the common perpendicular to 
D and D'. 


9.3. 


Structure of isometries. Generators of Is(.Y) and 
Is+{X) 


A good number of questions that can be asked about / e Is(X), con¬ 
cerning fixed points, globally invariant subspaces, and so on, are answered by 
the following structure theorem: 

9.3.1. Theorem. Take f € ls(X). There exists a unique decomposition 
f — ^1° 9 satisfying the following conditions: g is an isometry of X, the set 

G of fixed points of g is non-empty, and the translation vector ^ lies in G. 
Under these conditions we also have G — Ker(/ — Id^) and t^o g = g o t^. 

Proof. We fist observe that X = Ker(s — Id^) 0"^ Im(s — Id^) for every 
s 6 0(X). In fact, the dimensions add up, and orthogonality follows from 
the definitions: 

(u I s(t;) — w) = (u I s(u)) — (« I v) = (s(u) | s(w)) — (u | w) = 0. 


Take an arbitrary a € X and write the vector o/(o) as a sum ^ -f A, with 
h = /(f) — t and /(|) = Put x = a — t. By construction, 

x/(i) = xo-l- af{a) 4- /(o)/(x) = f-i- f/(f) - t- /(f) = /. 

By putting g = f I * o/, we do indeed have y(x) = x. The remaining assertions 
are trivial. □ 
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Figure 9.3.1 


For more details and the proof of Corollary 9.3.3, see |FL, 194-197). 

9.8.2. Remark. It was the existence of a line globally invariant under 
/ (and necessarily having direction Ker(/ — Id^^)) that allowed us to find 
the decomposition 9.3.1. Such a line can be found by a purely metric and 
topological reasoning, and is in fact given by (x,/(x)), where x e X is such 
that xf(x) = inf { y/(!/) | y € X). Proof: let y be the midpoint of x and 
/(x). Since / is an isometry (and a fortiori an affine morphism), /(y) is the 
midpoint of /(x) and f^(x) and we have, by assumption, 

d(y, /(y)) > d(x, /(x)) = d(/(x), /^(i)) = d(x, /(x)). 

But 

d(y, /(y)) < d(y, f(x)) + d(f(x), /(y)) = 2 x ^d(x, /(x)). 

Thus equality obtains everywhere, as this can only happen (cf. 9.1.1.1, the 
strict triangle inequality) if /(x) 6 (y,/(y)). In other words, the three points 
f{^)i f^(x) Are collinear. 

Conversely, every x 6 G in 9.3.1 satisfies x/(x) = inf { y/(y) | y 6 X }, 
as can be seen by applying 9.2.3 to the triangle {o, a — h, /(o)}. 

9.3.3. Corollary. 5c< a = dimX—dim(Ker(/—Idj^)). 

i) If Ker(/ — Id^j) = {0}, the map f has a unique fixed point, called its 
center. 

ii) Every f S Is(X) is the product of at most n + 1 hyperplane reflections. 
At least s are necessary if f has a fixed point, and at least s + 2 if f has 
no fixed point. 
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iii) Every f 6 Is'^(X) is the product of s eodimension-two reflections. Every 
f 6 T(X) is the product of 2 codimension-two reflections. □ 

The following classifications also derive from 9.3.1; 

9.3.4. Structure of plane isometries. If n = 2, every / e Is'^(X) \ 
T(X) has a unique fixed point a, called its center. We say that / is a rotation 
with center a, and / is characterized by its center and its angle, defined as 
$''^(/) 6 A(X). Every / 6 Is“(Ar) possesses a unique globally invariant line 
D and can be written in a unique way as f = t^oau, where ^ S D. The line 
D is called the reflection axis of D. 



JCp 


6 - 

ojx) 


Figure 9.3.4 


D 




-o 

f(x) 


9.3.5. Structure of three-dimensional isometries. Let n = 3. A 
rotation is any element / of Is''’(A^) \Idx having a fixed point. By 8.4.7.1, a 
rotation leaves point wise inv ariant a unique line D, called its axis; the angle 
of / is the constant A/(A) € [0, jt], where A is an oriented line orthogonal 
to D (figure 9.3.5.1). The axis and the angle, however, are not sufficient to 
determine / wholly; there are still two possibilities. See |DE2, 132-135], or 

9.14.5, for more details. 



A map / 6 Is"^ (X)\T(X) is called a screw motion; a screw motion leaves 
globally invariant a unique line D, called its axis, and can be written in a 
unique way as t^o g, where g is a rotation with axis D and ^ € D (figure 
9.3.5.2). 

There are two kinds of elements in Is~(A’). The first kind (figure 9.3.5.3) 
leaves at least one point invariant, and can be written as r o ajj , where H is a 
uniquely determined hyperplane and r b a rotation with axis perpendicular 
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to H (or r = Idx)- The second kind has no fixed point and can be written in 
a unique way as t^ocTfj, where H is a. hyperplane and ^ € H (figure 9.3.5.4). 

9.3.6. Examples in dimension 2. Let n = 2, and take 0 , 5 , o', 6' such that 
ab — a'b'. By 9.1.6, we know there exbts a unique / 6 Is'*'(X) (and a unique 
/ 6 Is''(X)) such that /(a) = o' and /(&) = It is natural to try to find an 
explicit geometric construction for this /. This is not hard. 

If / 6 Is'^(X) (and excepting the obvious case / 6 T(X)), we find 
the center u) of f (cf. 9.3.4) by taking the intersection of the bisectors of 
the segments [a, o'] and [6,6']. Another construction, also good for the case 
of orientation-preserving similarities (cf. 9.7.5), uses the fact that w is the 
intersection of the circles defined by a, a', {a, &)n(o', b’) and b, b’, {a, 6)n(o', b’). 
(See figure 9.3.6.1.) 

If / 6 Is” (X), it is enough to remark that the axis of / (cf. 9.3.4) contains 
the midpoint of x and f(x) for every x € X (figure 9.3.6.2). 

It is also useful to be able to construct compositions explicitly (figure 
9.3.6.3). Ill 1.7.5.3 we made essential use of the following result: if r and a 
are rotations with center o and b, respectively, and tsr = Idx, the center c of 



9.3. Structure of isometries. Generators of Is(A') and Is'^(A') 


211 





Figure 9.3.6.2 


t and the angles of r, s,t in A(X) satisfy the equalities 


2 ac,ab = angle of r, 26a, 6c = angle of s, 2 cb,ca = angle of t. 


To prove this result, apply 8.3.5 to write r = a—tac and s = u^a'—,, 

ab ah 

where D and E are appropriately chosen lines. Then sr = cjsitdi so the 

center of sr, and consequently the center of t = (sr)“*, is in the intersection 

D n E. We can thus write e = E n E. But then r = and s = 

at ac be ab 


and it suffices to apply 8.7.7.7. 


c 




9.3.7. Examples in dimension 3. In the case n = 3, corollary 9.3.3 
(iii) takes a simple and nice form, and can be independently proved using 
elementary means. A screw motion / with axis D, translation vector f and 
angle 6 6 ]0, rr] can be decomposed as the product of two codiinension-two 
reflections and ajj. Further, A and B are arbitrary lines orthogonal to D 
and intersecting D, satisfying the conditions d(A,B) = p/2 (where p — ||f|| 
is called the pitch of the screw) and AB = ^/2. This decomposition makes 
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for an easy study of the composition of screw motions, and yields numerous 
applications; see [FL, 338-339] and 9.14.38. 

Screw motions are naturally found in mechanics and in differential ge¬ 
ometry (for example as the first-order approximation to a differentiable curve 
in Is(X)); see 9.14.7 or [SY, 282). 

9.4. Polygonal billiards and the structure of plane 
isometries 


In this section X is always a Euclidean affine plane. 

In this section we utilize our knowledge of the structure of plane isometries to 
study a more elaborate example, involving two related problems: polygonal 
billiards and least-perimeter polygons inscribed in a given polygon. For the 
sake of readability, we deal first with the case of triangles, and then extend 
the discussion to polygons with arbitrarily many sides. The interesting thing 
is that the solution is radically different depending on whether the number 
of sides is even or odd. 

9.4.1. The case of a triangle. Let a,b,c be an arbitrary triangle in X. 

A triangle inscribed in {a, 6, c} is a triple of points q,/9, 7, where a 6 [6,c|, 
P € [e,a|, 7 € [a, 6] (cf. 3.4.3). The perimeter of {0,^,7} is -I- + 7a 

(cf. 10.3 or 12.3.1). The triangle {a,/3,7} is called a billiard trajectory, or a 
light polygon, if a € /3 € |e,a[, 7 € |tt,6| and each side of {o,6,c} is an 

exterior bisector of {a,/9,7} (this means that, in the terminology of 8.7.3.2, 

be is the bisector of and a$ and so on). The connection between billiards 
and least perimeters is afforded by the following 

9.4.1.1. Lemma. Consider, in a Euclidean plane, a line D and two points 
a and b lying in the same open half-plane determined by D (cf. 2.7.3). There 
exists a unique x & D such that ax bx is minimal, and x is characterized 

by the fact that D supports the bisectors of ax and xb. 

Proof. The classical trick is to introduce o' = cr£)(o). Then (a',b) meets D in 
a unique point x, and we have, for every y € D, 

ay by = a'y -b by > a'b = ax bx. □ 



Figure 9.4 1.0 


Figure 9.4.1.1 
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9.4.1.2. Note. The fact that the minimum satisfies the bisector condition 
ako follows from the first variation formula (9.10.5). 

9.4.1.3. Proposition. If the triangle {o, 6 , c} is acute (i.e., its angles 
lie in \0 ,'k/ 2]), it possesses a unique billiard trajectory, which is also the 
least-perimeter triangle inscribed in {a,b,c}, and is formed by the feet of the 
altitudes of {a,b,c}. 

If {o, 6 , c} has an angle > 7 r /2 at a, it possesses no billiard trajectory, 
but has a unique inscribed least-perimeter triangle {ct, 7 }, where a is the 
foot of the altitude from a and = 7 = a. 

9.4.1.4. Proof. Least-perimeter polygons always exist by compactness, since 
the perimeter function 

[ 6 , c] X [c, a] X [a, 6 ] 3 (q, 7 ) i-* a/9 -|- -I- 70 ; 6 R 

is continuous. Suppose the minimum is achieved for a 6 ] 6 , c[, P 6 ]c, a[ and 
7 6 ]o, 6 [. Then {a,/9, 7 } is a billiard trajectory by 9.4. 1 . 1 . Let A = (b,c), 
B = (c,a), C — (a,b) be the lines supporting the sides of {o, 6 ,c}, and 
introduce the reflections iTyi, og, oq. If {a,/ 9 , 7 } is a billiard trajectory, the 
composition / = acOB^A satisfies f{{oi,'))) = ( 0 , 7 )- But / £ Is“(X), 
so ( 0 , 7 ) is well-determined, being the axis of / (cf. 9.3.4). Thus a biUiard 
trajectory, if it exists, is unique. 

Now the axis of / is determined by the feet of the altitudes (check this), 
and the three feet lie in the interior of the corresponding sides if and only 
if the triangle is acute. There remains to show that in the acute case the 
billiard trajectory obtained has minimal perimeter, and that in the obtuse 
case the least-perimeter polygon is indeed the stated one. The latter is done 
by hand, observing that at least one of a ,7 must coincide with a vertex of 
{a, 6 , c} by the paragraph above. 








Figure 9.4.1.4 
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To treat the acute case, we unfold an arbitrary inscribed triangle q,/9, 7 
out on the plane as follows; consider the three isometries r = (TjI, s = ob'T, 
where B' = r(B), and t = crons, where C" = s(C). We find 

CTB' = Crr(Bj = C^aC^BC^a^ — 

so s = cr^CTBt similarly t = crA<^B<^c (so that t = /“^!). Set P' = r(P), 
7 " = 5 ( 7 ), a"' = <(«). Then 

aP + p-^ + 7a = aP' + P'')" + > at(a) 

by the triangle inequality. Applying 9.3.2, we see that the function x 1 —► xt{x) 
has its minimum on the axis D of t, which is the same as the axis of /. 
Conversely, if a belongs to this axis (and to |o, 6 ]) we easily see that a,P', 7 " 
and a.'" all lie on this axis, in the order named, so that aP' + P''^" + 7 "q'" = 
Qt(Q). □ 

9.4.2. Arbitrary polygons. We consider an n-sided convex polygon P 

(cf. 12.1) with vertices (ai)i=i.n And sides |oj, o,+i] supported by lines Di = 

{ai,ai+i) (where, by convention, n + 1 = 1 ). We generalize the notions intro¬ 
duced in 9.4.1; a polygon inscribed in P is an n-tuple of points (Qii)i=ri.„ 

such that Oj £ [ai,ai+i] for t = l,...,n. The inscribed polygon (Q!i)j=i.„ 

is a light polygon if Oj £ lai,ai+i( and ffc, ((oi, q,_i)) = (q,, a^+i) for all 
i = The perimeter of (a,) is defined as 

n 

P((“<)) = 

f=i 

9.4.2.1. Theorem. Any convex polygon has least-perimeter inscribed poly¬ 
gons. Any strictly inscribed least-perimeter polygon is a light polygon. Con¬ 
versely, any light polygon has minimal perimeter. 

Assame there exists a light polygon. If n is odd, the polygon is unique. 
If n is even, there are infinitely many light polygons. 

If n is even, a necessary condition for the existence of light polygons is 
that the relation £> 21 - 1^21 = 0 hold in the set A(X) of oriented angles 

between lines; in particular, if n = A, the vertices must lie on the same circle. 
A sufficient condition for the existence of a light polygon for n arbitrary is 
that there be a line D such that f(D) = D and D n 9 i(]oi+i, ai+ 2 [) 0 for 

all i = 1 ,..., n, where f and gi are defined below. 

Proof. The proof involves no new ideas; we just adapt the ones used in 9.4.1.4. 

9.4. 2 . 2 . Least-perimeter inscribed polygons exist by compactness. FVom 
9.4.1.1 we see that strictly inscribed least-perimeter polygons are light poly¬ 
gons. The converse follows from the second part of the theorem. 

9.4.2.3. We generalize the constructions in 9.4.1.4 by putting / = cr£)„ ■ ctbi 
and defining ff, by recurrence: 

9.4.2.4 

gi^Crc,, = O’wilD-lffl. - -I 9i + l = + (»=li- -i«)- 
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Since we have o'h(C) ~ h,<Tc'^ we see that 

9.4.2.5 

Qi = <^D, for all i; in particular g = gn = 

If (oj) is a polygon inscribed in (a,), we define points fit by recurrence, as 
follows: 

9.4.2.6 i9i = Qi, i92 = ffi(a2). •••, A+i = (i=l,...,n). 

The union U”=i[i®*i®»+il i® 3. broken line obtained by unfolding the inscribed 
polygon (ai). This broken line is a line segment if and only if (at) is a light 
polygon. 



®i fii Pi P* Si^i^Ps 



By construction we have p((aj)) = PiPi+i- By the strict triangle 

inequality we also have 

9.4.2.7 P((ai)) > at?(a,), 

and p((q,)) — aig(ai) if and only if the /3, lie on the same line, in the right 
order. 
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9.4. 2.8. Assume that (oj) is a light polygon. The line D = (q„,Qi) is 
invariant under /. Thus, if n is odd, / € Is“(Ar) and D is the axis of /, well- 
determined by 9.3.4. This shows that light polygons are unique when n is odd. 
(Non-uniqueness for n even will be shown at the end of the proof.) When 
n is even the condition f(D) — D implies, by 9.3.4, that / is a translation, 
or, equivalently, $”'^(/) = 0 in A{X). Writing / as a product of pairs of 
reflections, 

and applying 8.7.7.8, we obtain the desired result ^ 2 i~iD 2 i — 0- Co¬ 

cyclicity for n = 4 follows from 10.9.5. 

9.4.2.9. Now assume that the conditions in the last assertion of the theorem 

are satisfied. FVom the convexity of (a,) we conclude that the points = Dn 
ffi(]oi+i, ai+ 2 [) disposed on £) in the right order, so the inscribed polygon 
(oi) defined by Qj = for i = 1 ,..., n has perimeter p{(o,)) — Pig(Pi) 

(cf. 9.4.2.7). Conversely, the perimeter of an arbitrary polygon (a(), again by 
9.4.2.7, is equal to but 9.3.2 and 9.3.4 show that p((a|)) > p((“*)) 

always, whether n is odd (in which case / belongs to Is“ (AT) and has a unique 
axis) or even (and / is a translation and leaves invariant all lines parallel to 



Figure 9.4.2.9 
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the direction of translation}. We conclude that the polygon (oi) has minimal 
perimeter and is a light polygon. This also shows that every light polygon 
has minimal perimeter, since the line D — (ai,a„} satisfies the conditions at 
the end of the theorem, by definition of the gi- 

9.4.2.10. To conclude, assume that n is even and (cti) is a light polygon. The 
line D = (qi,q„) enjoys the right intersection properties with the segments 
Pidoi+i, Oj 4 . 2 [); by continuity, all lines D' parallel to D and sufficiently close 
to it enjoy the same properties and satisfy f(D') — D, since / G T(X). □ 

9.4.3. Remarks. The somewhat annoying fact that light polygons are 
not always to be found (cf. 9.4.1.3) occurs for every value of n; see 9.14.10 
and 9.14.33. When interested in finding a least-perimeter polygon for an 
actual problem, one should first construct / and apply 9.2.4.1 to check for 
the existence of light polygons. If they do not exist, one of the Qi will be 
a vertex of P, and one has to carry out a case analysis by hand, applying 
9.4.1.1 systematically. 

One can extend the above study to billiard trajectories that close only 
after two or more turns (cf. 9.14.9). Another possible direction is to look for 
light polygons in arbitrary convex compact sets of the plane. The situation 
is nicer if the set is strictly convex (cf. 11.6.4); then there exist light polygons 
with n vertices for every n > 2 (see 9.14.33 for a proof). The case of an 
ellipse is particularly impressive; we shall see in 17.6.6 that not only do light 
polygons exist for every n, but one of their vertices can actually be chosen 
arbitrarily on the ellipse. On the other hand, if the set is not strictly convex, 
the trajectory of a light ray may leave the set. 




Figure 9.4.3 


9.4.4. ErgODICITY. An interesting problem, currently under intensive re¬ 
search, is the ergodicity of polygonal or compact convex billiards. A billiard 
trajectory is determined by an initial point and direction: the ball follows 
this direction until it hits the boundary, then bounces off in the direction 
prescribed by the equality between the incidence and reflection angles, and 
so on forever. Some trajectories may hit the boundary at a vertex, where 
the reflection direction is not well-determined, but this does not matter, as 
ergodic theory proposes to study the set of trajectories only up to a set of 
measure zero. A compact convex set is called weakly ergodic if (almost) all 
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Figure 9.4.4 
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trajectories are dense in the set, and strongly ergodic if all trajectories are 
dense in the product of the set and 5* (where the second component gives 
the direction at each point of the trajectory). This problem is in some sense 
opposite to the search for closed trajectories, undertaken in 9.4.2. 

Some convex billiards are completely understood: circles and ellipses are 
never weakly ergodic (trivial for the circle, a consequence of 17.6.6 for the 
ellipse). Some triangles are not strongly ergodic (9.14.11). On the other hand, 
a square is certainly weakly ergodic, since trajectories whose directions have 
irrational slopes are dense, but not strongly ergodic, because each trajectory 
has only two slopes, opposite to each other. 

In other cases the situation is much more complicated. Even for arbitrary 
triangles with given angles, for example, it is an open question whether er- 
godicity holds. In order to glimpse the complexity of the problem, the reader 
can just try to follow a. trajectory; after a number of bounces, the situation 
quickly seems to get out of hand. Recently Lazutkin |LZ] showed that strict 
convex sets with boundary are never ergodic; this is a consequence of the 
existence of caustics, curves that generalize the homofocal ellipses tangent to 
trajectories in elliptical billiards. 

For references on the ergodicity of billiards, see |A-A], |CZ], [ME], [SI]. 

9.5. Similarities 

Consider / € GA(Ar) such that f € GO(J?) (cf. 8.8.2). Writing n for the 
ratio of /, we have f(x)f(y) = fixy, and f(x')f(y') = fix'y', whence 

f{x‘)f[y') _ x'y' 

f{^)f(y) 

whenever x ^ y. In other words, / preserves ratios between distances. The 
converse also holds: 

9.5.1. Proposition and definition. Let f . X X be a non-constant 

f(x‘)f(y') x'y' 

set-theoretical map such that ■ = - whenever x ^ y and fix) ^ 

Hx}f(y)^ xy 

f(y). Then f G GA(X) and f G GO(X). Such maps are called similarities 
of X, and they are orientation-preserving or reversing according to whether 
f G GO'^(X) or f € GO'"(X). The set of similarities (resp. orientation¬ 
preserving, orientation-reversing] similarities of X is denoted by Sim(X) 
(resp. Sim'^(X), Sim~(X)j. The ratio of f is defined as the ratio of f. 

Proof. Take distinct points xo,yo such that /(xq) ^ /(Sto)- The hypothesis 
implies that / is bijective and f(x)f(y) = pxy for every x,y € X, where 

p is defined hy p = If we now take an arbitrary homothety h 

xoyo 

of ratio p~ ^, the composition ho f is an isometry of X, whence, by 9.1.3, 

ho/ G GA(X) and ho f ^ 0[X). The conclusion follows by composing again 
with h~^. D 



220 


Chapter 9. Euclidean afline spaces 


9.5.2. Proposition. Consider f G Sim(X) \Is^X). There exists a unique 

Cl) €: X such that f (cl>) = cj; this point is called the center of the similarity f. 
We have a decomposition f — ho g — g o h, where h 6 and g 6 Is^, (X). 

Proof. This follows from 9.3,3. There is also a topological proof, using con¬ 
tracting maps—obviously either / or its inverse are contracting, depending 
on whether the ratio of / is greater or less than one. □ 

9.5.3. Characterization of similarities 

9.5.3.1. Let / be a similarity. FVom 9.2.1 and 8.8.5 .1 it follows that / 
preserves angles between lines and half-lines and that, in the two-dimensional 
case, / preserves or reverses oriented angles depending on whether it preserves 
or reverses orientation. In particular, / preserves orthogonality between lines: 

D±D' => /(D) ± f(D'). 

Furthermore, the image under / of a sphere S of X (cf. 10.7 if necessary) 
is also a sphere (with radius fj, times bigger). Either of these two properties 
characterizes similarities: 

9.5.3.2. Theorem. Let f be a set-theoretical bijection of X, where dimX > 
2. The following conditions are equivalent: 

i) / is a similarity; 

ii) if a, b, c, d are points of X satisfying a b, c ^ d and {a, b) ± (c, d), the 
images satisfy (f(a),f(b)) ± (/(c), /(d)); 

iii) the image f(S) of any sphere S of X is also a sphere. 

9.5.3.3. Proof. Assume first that (ii) is satisfied. In view of 2.6.5 and 
8.8.5.1, it is enough to show that / takes coDinear points a,b,c into colli- 
near points /(a), f(b), f(c). To do so, take vectors 02 ,...,o„ such that 
{a, 6 , 02 ,..., a„} is an orthogonal affine frame; by assumption and 8 .1.2.5, 
{/(®)i/(^))/(® 2 ), • • • 1 /(®n)} is an orthogonal affine frame. But 

(/(a)>/(c)) -L (/(«)./(«.)) 

for all i = 2 ,..., n, showing that (/(o), /(fc)) = (/(o), /(c)). 



9.5.3.4. Now assume that (iii) is satisfied; we start by showing that / takes 
collinear points into collinear points. This is easy in the opposite direction; if 
o', b', c' are distinct collinear points, the points o = f~^(a‘), b = f~^(b') and 
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c = f~^(c') axe collineax, else we could consider a sphere 5 containing a,b, c 
and its image f(S) would contain the collineax points a', b', c', a contradiction 
(see 10.7.2 if necessary). In the forward direction, take distinct collineax 
points a,b,c and assume that their images o' = /(o), b' = f(b) and c' = 
/(c) do not satisfy c' G (o', 6'). By the previous argument /~^((o', i>')) C 
(o, fc). Put P' = (o', 6',c'} and let d' be an arbitrary point in (o',fc'); then 
((c', d')) C (o,c) = (o, 6), so f~^(P') C (o,6). Now let 5 be a sphere 
containing o and 6; we have 5 n (o,fc) = {a, fc), and C = f(S) n P' is 
a circle because f(S) is, by assumption, a sphere. But this would imply 
/“*(C") C (o,fc), which is absurd, since a circle must contain at least three 
points. 





9.5.3.5. After an application of 2.6.5, there remains to show that a map 
/ G GA(X) that takes spheres into spheres is a similarity; in fact, it is 
enough to assume that there is one sphere whose image is also a sphere. 
After composing / with a dilatation, we can assume that there is a sphere 
5 such that f(S) = S. We first show that / fixes the center w of 5. This 
is done by considering two diametrically opposed points o, fc on 5, and the 
hyperplanes and Hi,, tangent to 5 at a and b. Hyperplanes tangent to a 
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sphere axe characterized by intersecting it in a single point, and this condition 
is preserved by /; thus and f(Hf,) are the hyperplanes tangent to 5 at 

/(o) and f(h). Now two points on S are diametrically opposed if and only if 
the tangent hyperplanes at these points are parallel; since parallelism is also 
preserved by /, we conclude that f{Ha) and are parallel and f(a),f(b) 

are diametrically opposed. Using the fact that / is affine, we obtain 

as desired. 

This brings us to the vector case, / G GL(Ar„); but then saying that 
f(S) = S is equivalent to saying that / preserves the norm of X^, whence 
/ G 0(Xu), by 8.1.5. □ 



9.5.3.6. Note. If one weakens the assumptions in 9.5.3.2, requiring that / 
be a bijection between two parts of X only, condition (iii) is not sufficient to 
guarantee that / is the restriction of a similarity. For example, / can be an 
inversion (10.8.2) or a composition of inversions (18.10.4). For a discussion 
of bijective maps / between subsets of affine spaces that take spheres into 
spheres, see [CD] and [G-Wj. 

9.5.4. Liouville’s theorem 

9.5.4.1. Liouville’s theorem provides a characterization of similarities in 
terms of differential geometry, but its discussion is relevant to other parts of 
this book as well; 10.8.5, 18.10, 20.6. 

We start from a similarity / of X, considered as a C°° map from X 
into itself. Every morphism / G A(Jf; X) is of class C°° and its derivative 
f : X —* L(X-,X) is just the constant map x h -* /, that is, f'(x) = / for 
all X G X (this follows from 2.7.7). In the particular case / G Sim(X), we 
have /'(x) = / G GO(.Y) for all x € X; this can be expressed by saying that 
/ preserves infinitesimal angles. The question arises whether there are other 
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maps with the same property. In dimension one the question is trivial, since 
every map with nowhere vanishing derivative satisfies the property; we thus 
restrict our analysis to the case n > 1. 

9.5.4.2. Definition. Let U and V be open subsets of a Euclidean affine 
space X. A map f G C^(U-,V) (that is, a map f : U —* VJ is said to be 
conformal if f is bijective and f'{x) G GO(.Y) for every x € U. A confor¬ 
mal map f is orientation-preserving (resp. reversing ) if f'{X) G GO"''(X) 
(resp. GO'“(X)). The corresponding sets are denoted by Conf({7;V) and 
Conf*({7;V); we also write Conf(£/) and Conf^({7) for Conf({7;{7) and 
Conf*(V; U). 

It is easy to see, using injectivity, the theorem of invariance of domains and 
the fact that GO(X) C Isom(A’; X), that a map / G Conf({7; V) is necessarily 
a diffeomorphism onto its image. 

9.5.4.3. First example: holomorphic functions 

Assume that X has dimension n — 2, and identify it with an affine complex 
line (cf. 8.3.12, 9.6.4). A function f ■. U X, where U is an open subset of 
X, is called holomorphic if it has a complex derivative at every point; this 
notion is well-defined because, even though we have to orient X to make sense 
of it, the choice of orientation does not affect the outcome. A classical result 
states that / G Conf"*"(tf;/(t/)) if and only if / is injective, holomorphic, 
and its derivative is never zero. The proof follows from 8.8.4 .1 (see |CH2, 67] 
if necessary). 

We thus obtain a great number of infinitesimal similarities which are 
not global similarities, because injective holomorphic maps on open sets U 
with nowhere vanishing derivative are very abundant. The space of such 
maps, in fact, cannot be parametrized in any reasonable way by finitely many 
parameters. 

9.5.4.4. Second example: inversions 

We shall see in 10.8.5 that an inversion / with center o and arbitrary power 
belongs to Conf(X \ a). The composition of a finite number of inversions 
is thus an element of Conf ({7; /({/)), where U is equal to X minus a finite 
number of points. Again, these maps are not global similarities. 

9.5.4.5. The number of degrees of freedom in the choice of inversions is finite, 
and we shall see in 18.10.4 that it is equal to ((n + 2)(n + l))/2. 

The next theorem shows that, for infinitesimal similarities, there is a funda¬ 
mental difference between the cases n = 2 and n > 3; the theorem also shows 
that there is a radical difference between the “local” case (the open set U is 
a proper subset of X) and the “global” case (U = 7f). See 12.8, 16.4 and 
18.3.8.6 for other examples of passing from local to global. 

9.5.4. 6 . Theorem. For n = 2 we have Conf(7f) = Sim(X). For arbitrary 
n, a C* map f G Conf(X) belongs to Sim(7f). If n > 3, U is an open set of 
X and f G Conf (U;/(f/)) ts of class C*, then f is the restriction to U of a 
product of inversions of X (and so must be C°°J. 
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9.5.4.7. Note. The equality Conf(Jf) — Sim(X) actually holds for all n 
(that is, the C* condition in the theorem could be replaced by C*), but the 
proof is much more difficult, resorting to fine analytical techniques (cf. [HM]). 
The proof given below for the case n > 3 is due to R. Nevanlinna [NA]; see 
9.5.4.21. See also 9.14.45 and [LF3, 59, exercise 12). 

9.5.4.8. The case n = 2. After vectorialising and orienting X and identi¬ 
fying it with C, we have a map / : C —» C, injective and of class C*. Since 
C is connected, we have either /'(z) G GO''‘(C) or f'(z) G GO~(C) for all 
z G C. After composing / with a conjugation 2 —♦ z, we can assume we are 
in the first case. By 9.5.4.3 the map / ; C —» C is holomorphic and injective, 
and, by a classical result, must be of the form z az + b, for some o, fc G C, 
and hence a similarity (see 9.6.4 if necessary). (The result quoted can be 
found in [CH2, 181-182); the proof is nice, but it uses the notions of essential 
singularities and meromorphic functions.) 

The proof for n > 3 uses, among other thing, the following famous lemma; 

9.5.4.9. Braid lemma (see also 1.9.14). Let V and W be vector spaces 
over a field of characteristic ^ 2, and k : V xVxV —a trilinear map, 
symmetric on the first two variables and skew-symmetric on the last two. 
Then k — 0. 

This lemma is often pivotal in differential geometry, as exemplified by this 
case; see also [SB, 333] and [KO-NO, volume 1 , p. 160). The name comes 
from the fact that its proof mimics the first six steps in making a braid: after 
six crossings, three of the first two strands and three of the last two, we’re 
back in the original situation; but swapping the last two variables three times 
switches the sign of the result, leading to the desired conclusion. □ 



kix. y. z) - 

kiy. X. z) = 

kix. y.z) - 

k{y. z. x) = 

— k{x. y. z) _ _ 

k{z. y. jc) = 

- kix. y, z) - 

k(z. X. y) = 

kix. y. z) - 

k(x. z. y) = 

kix. y. z) - 

k{x. y. z) = 

— kix. y. z) = 0 . 



Figure 9.5.4.9 
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9.5.4.10. By assumption, f'(x) G GO(.Y) for aJl z G {/. Denoting by /i(z) 
the corresponding similarity ratio, we have 

9.5.4.11 (/'(z)(u) |/'(z)(u)) = foraUu,uGX. 

Let u, V be fixed orthogonal vectors; we have (/'(z)(u) | f'{x)(v)) = 0, and 
differentiation gives, for all tu G .Y, 

(/"(z)(u, w) I /'( 2 )(w)) + (/'(a:)(u) | f"{x)(v, u;)) = 0. 

The braid argument and the symmetry of f" show that (/"(z)(u, u) | 
f'(x)(w)) = 0 for any w orthogonal to tt and v. But for such a w the 
vectors /'(z)(u), f'(x)(v) and f'(x)(w) are orthogonal, and consideration of 
a basis of the orthogonal complement of Rit+ Ru leads to the conclusion that 
/"(u,u) gR/'( x)u + R/'(z)»;. Thus there exist two functions a,P :U —* R 
such that 

9.5.4.12 /"(^)(«^,«) = 

for every z G f/ and fixed orthogonal vectors u,v. 



Upon differentiation of ||/'(r)(w)|P = ^^(z)||u||^, we obtain 
(/"(z)(u,v) I /'(z)(u)) = ;i'(z)(r;)/x(z)||u||^, 
and substituting this into 9.5.4.12 gives 


9.5.4.13 


H(x) fi(x) 


We now put p = ^ and combine 9.5.4.12 and 9.5.4.13 into 


9.5.4.14 p'(z)(v)/'(z)(u) + p'(z)(u)/'(z)(r;) + p(z)/"(z)(u, v) = 0 

for every z G 17 and fixed orthogonal vectors u, v. 

9.5.4.15. After catching our breath, we differentiate again, this time formula 
9.5.4.14, obtaining 

p"(x)(v,w)f'(x)(u) + p'{x){v)/"{x){u,w) + p"{u,w)f'{x}{v) 

+ p'(x)(u)f"{x){v,w) + p'(w)/"{x){u, v) + p(x)f'"(x)(u, v,w) = 0. 

We verify that the sum of the last five terms is symmetric in u and w, so the 
same must hold for the first term: 


p"(x)(v, w)f'(x){u) = p"(x)(v, u)f'(x)(w) 
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for all orthogonal vectors u,v,w (since p itself does not depend on u and 
d). But the two vectors f'(x)(u), f'(x)[w) are linearly independent, so that 
p"(x)[u, u) = 0 for all orthogonal pairs it, v because n > 3. 

By the proof of 8.8.5.1 (and here infinitesimal arguments do correspond 
to reality), p"(x) must be proportional to the Euclidean structure. In other 
words, there exists a : 17 —» R such that 

9.5.4.16 p"(2:)(u, w) = i7(2:)(u I u) 

for all a: G 17, It, u G X. The factor cr is in fact constant; to see this, we 
differentiate 9.5.4.16 (which we can do because / is C*) to get 

p'"(x){tL, V, w) = o''(2:)(ti;)(u | u). 

The second term is symmetric in v and w because p'" is symmetric; we thus 
have ((t'(z)(u;)u — (T'( 2:)(u)tu | u) = 0 for all u, so o''(2:)(iu)u = a'(x)(v)w and 
(7 = 0 because v and w are linearly independent. 

9.5.4.17. The equation p"(u,v) = 17(11 | v) can be integrated by inspection, 
yielding 

9.5.4.18 p[x) = a||a:o2|p + h, 

where a, h are constants and iq G AT. If o = 0, /i is a constant and differen¬ 
tiating 9.5.4.11 gives /" = 0, so / is affine and must be the restriction to U 
of a similarity. If 6 is zero, we observe that xo ^ U and compose / with an 
inversion t of pole Xo- The composition to f is still an infinitesimal similar¬ 
ity, and we have pio/{x) = pi(^f(x))pf(x). But the factor pi for an inversion 
with pole Xo is ||xoxp up to a scalar multiple (cf. 10.8.5.1, the power of an 
inversion), so the factor p,o/ is constant, and i o f is a similarity restricted 
to U. Notice that the case 6 = 0 is excluded if t7 = Tf, so an infinitesimal 
similarity f : X —> X of class C* in dimension > 3 is a global similarity. 
9.5.4.19. To conclude the proof, we will show that the case a ^ 0, b ^ 0 
cannot occur. Let / be the inverse of f : U —‘ J[U). This diffeomor- 
phism is still an infinitesimal similarity and, as observed in 9.5.4.18, we have 
(/(x))/zy(x) = /i„o/ = 1 for any x G t7. By 9.5.4.18 there exist constants 
c, d such that 

9.5.4.20 (a||x^^|p + 6) (c||/(xo)/(x)|p + d) - 1 

for all X G 17. This shows, to begin with, that / maps subsets of spheres of 
center Xo contained in U into subsets of spheres of center /(xo); next, that / 
maps a segment of U whose support contains xq into a segment of f(U) whose 
support contains /(xq). Fixing u € X with ||u|| = 1, we obtain a real-valued 
function 4> defined by the equality 

/(xo + tu) = /(xo) -t- 

where the unit vector v and the interval of variation of t are appropriately 
chosen. By 9.5.4.11 we have = (of^ -t- &)~*i and also (ot^ -|- fe)(c(^"(t) -|- 
d) = 1, and the two relations can only coexist if either o or 6 vanish. □ 
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9.5.4.21. Note. Liouville’s original demonstration was quite unexpected, 
using facts apparently unrelated to the theorem, namely: a system of three 
mutually orthogonal surfaces in dimension three intersects along curvature 
lines (Dupin’s theorem); and spheres are characterized by the fact that all 
their points are umbilical, that is, they have many more curvature lines than 
ordinary surfaces. Liouville finished off with 9.5.3.2 (iii). 

9.5.5. The umbilical locus and the cyclic points. We study here 
the affine analogue of 8.8.6. Using the notation of 7.6, we denote by X'' the 
complexified projective completion of our Euclidean affine space X, We have 
X^ ' = X^' U oox<', with oox< = P(X''). As usual, p ; X^ -*P(X^'') denotes 
the projection from a vector space onto the associated projective space. By 
8.8.6 and using the terminology of chapter 14, the quadratic form defined 
on oox*- = P(X'') is a quadric with image fl = If n = 2, fl 

consists of two points, and if n = 3, fl is a conic (cf. 14.1.3.7). 

9.5.5.1. Definition. The subset Q of oox<'- is called the umbilical (locus) of 
the the affine Euclidean space X. For n — 3, Q is a conic, and for n = 2, 
fl consists of two points, called the cyclic points of X and denoted by {/, Jj. 
Choosing one of the two cyclical points is equivalent to orienting X, according 
to the convention stated in 8.8.6.2. 

There is no abuse in identifying the isotropic lines /, J with the points to 
which they give rise in the projective space. 

We now combine 8.8.6.4 and 5.2.2: 

9.5.5.2. Proposition. A necessary and sufficient condition for f G GA(X) 
to be a similarity is that /^" (fl) = fl. If f = 2, f €: Sim'^(X) (resp. f G 
Sim~(X)^ if, in addition, ff' fixes (resp. switches) the two cyclic points. □ 

9.5.5.3. Example. Orthogonality between two lines D, D' of X can be 
translated by 

D ± D' <=» [oopc, oo/j/f:, /, Jj = -1, 

for switching I and J does not alter the cross-ratio in the case of a liaiinonic 
division (cf. 8.8;7.4). 
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9.6. Plane similarities 


In this section X stands for an oriented affine plane. If X is not oriented 
to begin with, we choose an arbitrary orientation for it. 


This section consists mostly of elementary examples with visual appeal, after 
the first five paragraphs, which contain a summary of properties and classical 
expressions for plane similarities. 

9.6.1. Structure. A map / G Sim'*‘(A’) is either a translation or the 
product, taken in either order, of a rotation and a homothety with same 
center w, where w, the center of /, is the unique fixed point of /. A map 
/ G Sim~ [X) is either an isometry, in which case it is a reflection through 
a line followed by a translation, or the product, taken in either order, of a 
homothety of center u> and a reflection through a line cont^ning u), where u) 
is the unique fixed point of /. 

This follows immediately from 9.3.4 and 9.5.2. 

9.6.2. Simple transitiveness. For any four points a,b,a',b' of X with 
a ^ b and a' ^ b', there exists a unique / G Sim'^(X) such that /(o) = o', 
f(b) = b'. 

To see this we use a homothety to reduce the problem to the case of an 
isometry, ab = a'b', and then apply 9.1.6. 

Given a,b,a',b', it is interesting to try to construct geometrically the 
center ut of the unique / taking a to o' and b to b'. The solution conskts in 
finding the intersection point e = (o, 6)n(o', &'), drawing the circles containing 
o, o', e and b, b', e, and tcikiiig the intersection w of the two circles other than 
e (unless the circles are tangent at e). This construction is a consequence of 
9.5.3.1 and 10.9.4. See also 9.14.14. 



Figure 9.6.2 
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9.6.S. Angle of an orientation-preserving similarity. We decom¬ 
pose / £ Sim'^(X) as in 8.8.3 and assign to it the angle of the associated 
rotation. Denoting this angle by a, we have A/(A) = a for every oriented 
line A. 

9.6.4. Plane similarities and complex numbers. The fundamental 
remark here is the affine counterpart of 8.3.12 and 8.8.4. Let X be an oriented 
affine plane; then X has a natural complex line structure. For this natural 
structure, we have GA(Ar) = Sim'*’(AT). In the coordinates associated with 
an arbitrary complex affine frame of X, every / G Sim'*' (X) is of the form 
z az + b and every / G Sim~(Ar) is of the form z az + b. The angle 
of / G Sim'*' (A") (cf. 9.6.3) is simply the argument of the complex number a 
(cf. 8.7.8.4), and the modulus of / is the absolute value |a| of o. 

The discussion about simply transitiveness (9.6.2) and all practical cal¬ 
culations in Sim'*' (A) are facilitated by writing / i z az + b. 

9.6.5. Cross-ratio of four points. Since A is, in a natural way, a 
complex affine line A, we can complete it into the complex projective line 
A = A U oojf (cf. 5.1.3). (Notice that A should not be confused with A*", 
introduced in 9.5.5. The latter is a complex projective plane, with a line at 
infinity, while the former is a complex projective line, with only one point at 
infinity.) The cross-ratio of four points of A is their cross-ratio as points of 
Al it is a number in C U oo, and is finite if the four points are distinct. The 
connection between this complex cross-ratio and the Euclidean structure of 
A is very nice; 

9.6.5.1. Proposition. If z = [a,6,c,d], where a,b,c,d G A, the absolute 

ac/be 

value of z is determined by the distances: \z\ = - ^ 7 - ; and the argument of 

ad/bd 

z is determined by the oriented angles: arg[z) — ca, cb — da, dbinA(X). 

9.6.5.2. This result has many elementary consequences, which we do not 
enumerate exhaustively; we mention but two of them. The first refers to 
harmonic quadrilaterals, i.e., quadruples of points of A whose cross-ratio b 
— 1 (cf. 9.14.15). The second b that z = \a,b,c,d] b real if and only if its 
argument is 0 or ct; thb is equivalent, by 8.7.7, to the condition (c, a), (c, b) = 
{d,a), (d,b) in A(X)', but we shall see that this is exactly the condition that 
a,b,c,d be cocyclic or coUiiiear. It follows, for example, that homographies 

az -j- b » 

z h—► —~j—^ of map circles into circles (or lines). This result will follow 
again in a more general context (chapter 20; see 18.10.4 and 20.6). 

9.6.6. An elementary construction. The problem b to find a circle 
through a given point x and tangent to two given lines D, D' . The case D || D' 
is left to the reader. We put a = D H D', and draw any circle C tangent to 
D and D' and contained in the convex set defined by D, D' and x. The idea, 
developed in figure 9.6.6, b to find the homothety of center x which maps C 
into a circle containing x. 
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9.6.7. The principle of similar figures. We give as motivation for 
this principle the following question: Given two distinct lines D,D' of X and 
points m, m' on D, O' , respectively, each of which moves with constant speed, 
what is the envelope of the line mm'1 

9.6.7.1. Let / be the orientation-preserving similarity such that f(m) = m' 
and }(n) = n', where m and n are two positions of the point that describes 
D and m' and n' are two positions of the point describing D' (cf. 9.6.2). 
Then, if p and p' are two further positions of the points on D and D', we 
will still have /(p) = p', since the condition of constant speed and the fact 

that / € Sim'^(Jf) are both equivalent to —— = (cf. 2.4.6). What we 

m'n' 

need now is to show that every point linked to m and m' by an orientation¬ 
preserving similarity also describes a line. This is where the principle comes 
in: 

9.6.7. 2. Principle. Let f be an orientation-preserving similarity of X and 
m" a point linked to the pair (m, m'), where m' = f(m), by an orientation¬ 
preserving similarity (this expression means that there exists a similarity g £ 

GO'^(X) such that mm" = g'(mm') for every m €: X). Then m i—> m" is 
an orientation-preserving similarity. In particular, if m describes a line, a 
circle, etc., so does m". 

Proof. We use 9.6.4. Working in an arbitrary frame, we have /(a) = az b 
and 5 f(u) — cu, where a, 6, c 6 C. Then 

m" = m mm" = z c[f[z) — z) = z -f- c(oz -t- & — z) = (oc — c -|- l)z -|- be. 

But the map z (ac — c -f- l)z be is obviously an orientation-preserving 
similarity. □ 

Returning now to the original problem, we see that the foot h of the per¬ 
pendicular dropped from a to the line (m,m') is linked to (m, m') by an 
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orientation-preserving similarity, since a is the center of the similarity m i-+ 
m'; thus m describes a line E and the envelope of the line (m,m') is the 
parabola with focus a and tangent at the vertex equal to E (cf. 17.2.2.6). 

For other applications of the principle, see 10.13.18. 

9.6.8. Double pedal curve of two circles. The pedai (curve) of 
a curve C, relative to the point a, is the set of points m €: X for which 
there exists a tangent to C containing m and orthogonal to the line (o, m) 
(figure 9.6.8.0.1). The double pedal of two curves C,C' is the set of points 
m G X for which there exists a tangent to C and a tangent to C intersecting 
orthogonally at m (figure 9.6.8.0.2). 
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Pedals of the circle are called Pascal lima^ons (their eponym is the father 
of the better-known Blaise Pascal) and occur in many situations, if for no 
other reason because they are among the simplest curves of fourth degree 
(bicircular quartics). Figure 9.6.8.0.3 shows their possible forms. The one 
curve displaying a cusp is called a cardioid; there is a discussion of it in 
9.14.33 for those who find it attractive. In 9.14.22 we study the general case. 
The particular case where the tangents at the intersection point form an angle 
of 2n/3 can be found in [DQ, 169, exercise 77], 





Figure 9.6.8.0.3 

In the case of double pedals, it is easy to see that if the two curves are 
closed, regular and of class C‘, the double pedal can be naturally decomposed 
into two curves x, x', according to the orientation of the tangents deriving 
from a fixed orientation of C,C'. But this geometric decomposition does not, 
in general, imply an algebraic decomposition. We shall see that if C,C' are 
circles there is really an algebraic decomposition, and x, n' are the pedals of 
C, C with respect to appropriately chosen points (figure 9.6.8.0,4). 

9.6.8.1. We assume that the circles C,C' are not concentric, otherwise the 
problem is trivial. Observe first that there exist exactly two orientation¬ 
preserving similarities /i and /2 that rotate by a right angle and take C into 
C (cf. 8.7.3.5 and 9.6.3). To see this, take m € C and diametrically opposite 
points m[,m 2 G C such that, denoting by a,o' the centers of C,C', we have 
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Figure 9.6.8.0.4 
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am ± o'm'|. For /i and /a take the orientation-preserving similarities such 
that fi(a) = o' and fi(m) — m[ for t = 1,2; such similarities exist and are 
well-defined by 9.6.2. It is clear that fi(C) = C because fi(C) is the circle 
of center f(a) = o' and radius equal to the radius of C■ 



Now let D = TmC, D' = be the tangents to C,C' at m,m'j, 

respectively, set a: = D n £)', and let h be the projection of wi, the center of 
fi, onto D. V/e shall use the following trivial lemma (figure 9.6.8.2); 

9.6. 8 . 2. Lemma. Let {o, 6, c} and {a',b',c'} be two right triangles, (that is, 
such that {a,b) 1 (a,c) and (o', 6') _L (o', c')/ There exists an orientation¬ 
preserving similarity f satisfying f(a) = o', f(b) = b', f(c) = c' if and only 
if the following equality holds in 

(6,o')7(6.c)= (6'.o'M&'.c'). 

Furthermore, f is uniquely determined by the value of this oriented angle 
between lines. □ 

9.6.8.3. Applying this lemma, we see that the angle (m'j, wi), (m'j,m) is 
well-determined by /i; we call a this element of A{X), which depends only 
on C, C and the choice of fi- By 10.9.5, we have a = {xm^,u)i), {m\, m) — 
(x, m), {x,wi) because the four points (j}i,m,m\,x are cocyclic, since 


(wi,m),(a;i,mi) = (i, m),{x,m[) = 6 
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(cf. 8.7.7.4 and 10.9.5). Thus {x,h),{x,u)i) = a, and another application of 
the lemma shows that x is deduced from h by a fixed similarity gi- Since 
h describes the pedal ni of C with respect to u)i, the point x must describe 
^nd the conclusion is that the double pedal of C and C is made up 
of and the analogous curve ff 2 (^ 2 )- 


C’ 



Figure 9.6.8.2 


9.6.9. EaDEM MUTATA RESURGO. We discuss here a property of logarithmic 
spirals that impressed Jacob Bernoulli to the extent that he requested that 
such a spiral, together with the above Latin quotation (loosely translated as 
“Here am I again, transformed”), be engraved on his tombstone in Basel (see 
9.14.32). The property is that a logarithmic spiral is globally preserved by 
certainly similarities. This can be easily explained by the fact that Sim,J (Jf), 
for a fixed u € X (or, alternatively, Sim'''(.^)) contains non-trivial subgroups. 
9.6.9.1. For fixed k 6 R+ cind w € X, introduce the set 

G = {/(t) = o0-»(A(t)) I t e R} c Sim+(A-), 

where H^.kt is the homothety of center w and ratio kt, and (A(t)) denotes 
the rotation of center u) and angle A(f). We see that G is a subgroup of 
I Sim„(X); it is homeomorphic and isomorphic to the additive group R, and 
we call it a oiie-parameter subgroup of Sim^(X). A logcwithmic spiral of pole 
u) is an orbit of G (distinct from {a;}), for an arbitrary k. 

By definition, logarithmic spirals are invariant under G. The image of a 
logarithmic spiral under a similarity is still a logarithmic spiral. Every curve 
linked to a logarithmic spiral C and its pole a> by an orientation-preserving 
similarity is still a logarithmic spiral. An example is the pedal of C with 
respect to u), as can be seen by noticing that the tangent TmC to G at m 
makes an constant angle {m,u)),TmC € A(X) with the line This is 

shown in the following way: since G acts transitively on G, by definition, 
there exists / 6 G such that f{m) = n for any m, n G G; in particular, 

(m,i:^T,nC={f{m),f{::^),f(T,r,C) = {n,^Tr.C. 
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Figure 9.6.9.1 

For a converse and other properties of logarithmic spirals, see exercise 
9.14.21 (or Bernoulli’s tomb). For examples of logarithmic spirals in nature, 
see |WL, 69-72]. 

9.7. Distances between several points 

9.7.1. Proposition. Let (3ti)i=o.i.« affine frame of X. If two 

points x,y S X satisfy XiX = Xiy for all t = 0,1,...,n, we have x = y. Let k 
be an integer, and (li), (y,-) (t = l,...,k) subsets of X such that XiXj = j/ij/y 
for all i and j; there exists f 6 Is(Jf) such that f{xi) = yi for all i. 

Proof. Vectorialize X at lo. By 8.1.2.4, we have (i, j x) = (a:,- j y) for all i. 
But the linear forms (a:,- | ■) are linearly independent for i = 1,..., n because 
(aii) is an affine frame; this shows that x = y. 
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By the first part, it is enough to show the second for k < n+1 and affinely 
independent subsets (a:,-), (j/,). In fact, we can assume fc = n + 1, since every 
isometry / 6 Is(y, Y') between subspaces of X can be trivially extended to 
an isometry / of X. We will argue by induction on n = fc — 1. For n = 1 
we apply 9.1.6 (or we can start from ra = 0, which is trivial). Suppose our 
conclusion holds for n, and let Y = {xo,xi,...,x„), Y' = (yo,yi,---,yn) be 
the subspaces spanned by each set of points. There exists g 6 ls(X) such that 

g{Y') = Y. Applying the induction assumption to the subsets (a:i)i=o.i. n 

and j ^ of the n-dimensional space Y, we see that it is enough to 

prove the result for n+ 1 when (a,) and (y^) satisfy Xi = yi for t = 0,1,..., n. 

Vectorialize X at scq. By 8.1.2.4 we have (i, | in + i) = (a:,-1 ^n+i) for t = 
1,..., n, so x„ + i and !/„+i belong to the same affine line D of X, orthogonal 
to Y. But loin + i = loj/n+ii SO 9.2.3 shows that d(a:„ 4 .i,y’) = d(y„ 4 .i,y’), 
whence in + j = y„+i or y„ + i = (Ty{x„+i). □ 



9.7.2. Remark. Proposition 9.7.1 expresses the uniqueness (up to isometry) 

of subsets (a:i),=o.i. k for which the distance d^j = XiXj are given. The 

existence of such sets will be discussed in 9.7.3.4. 

9.7.3. Relation between distances 

9.7.3.1. It is reasonable to suspect that the distances dij = z,'Zy between 

n +1 points (zi)»=o.i.n in a space X of dimension n —1 satisfy some universal 

relation. For n = 1 , since one of the three points in necessarily between the 
other two, the product (doj + d ()2 — dj 2 )(doi +di 2 ~do 2 )(d ()2 + di 2 ~<foi) must 
be zero. In arbitrary dimension n, the argument used in proposition 9.7.1 also 
leads to suspecting the existence of such a universal relation: taking the case 
n = 2 , for example, and giving Zo,®i ,®2 a-nd ^ 13 ,^ 23 , there are only two 
possible choices for Z 3 , symmetrically placed with respect to the line ( 11 , 12 )- 
We will in fact find this universal relation, by expressing the volume 

of the parallelepiped built on the points (a;i)i:=().i.„ iu n dimensions as a 

function of the distances dij = XiXj. If the points Zj actually belong to an 
(n— l)-dimeiisioiial subspace, the n-dimensional volume will be zero, and this 
will give the desired relation. 
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The volume is derived from 8.11.5 and 8.11.6, after vectorializing X at 
Xo and using again equality 8.1.2.4, in the form (ij |a:y) = ~ ^?>)- 

The desired relation is thus 


^1 




- <^n) 


<^tl 


= 0 . 


A rigorous proof would involve embedding AT in a space X' of dimension n 
and applying 8.11.8 (ii) and (v), for example, to the vectorialization of X' at 
Xq. 



The relation above has the drawback of not being symmetric with respect 
to the indices; the index 0 is special. To remedy this, we define the Cayley- 
Menger determinant of (sCijo.i. k ^ 




0 

1 

1 • 

1 



1 

0 

*^01 


9.7.3.2 

r(a:o,®i,.. •,*![) — 

1 


0 

■■ <^fc 



1 

^kO 


• 0 


9.7.3.3. Lemma. We have (cf. 8.11.5) 

_^ ^ 

Gram(a:oa:i, -^aoifc) = — ^ -r(aEo,a:i,..., ifc). 


Proof. A refinement of the argument used in 8.2.2 and 8.11.6. Take an ar¬ 
bitrary orthonormal frame, and denote by the coordinates of Xi in this 
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frame. Using standard operations on determinants, we obtain 


A(zo^i 1 • • •) ®fc®i) 




1 

1 

1 



*0 ■ 

Xq 

1 

0 


■ 

■ 

1 

0 


■ 

- 

1 

0 


0 • 

• 0 

0 

1 


The determinant of this last matrix D is then multiplied by the transpose of 
the determinant of the matrix obtained from D by switching its last two rows 
and its last two columns, giving 

{xo\xo) (a:o|a:i) •• (*0 | **) 1 

(xi \xo) (a:iia:i) • •• (a:i | Sfc) 1 

A(2oX| ,. .. , ) ~ ; I *. I i 

{xk\xo) (atfc|a:i) •• (** I **) 1 

1 1 •• 1 0 


where the “vectors” Xi are to be understood as the n-tuples of coordinates of 
Xi in the frame being used. We next replace (x, |a:y) by |(||xt|P + ||*y|p-«i?y). 
and eliminate all the ||xi|p by subtracting the appropriate multiples of the last 

column and line from the others. We finally obtain - —-jg—r(xo. Xi,..., Xn). 

□ 

9.7.8.4. Theorem. Let (xi)i=o,i.n 6e arbitrary points in an (n — l)-dimen- 

sional Euclidean affine space X. TVien r(xo,Xi,... ,a:„) = 0. A necessary and 

sufficient condition for (xi)i=o.i.n-i fe be a simplex of X is that r(xo,Xi, 

...,x„_i) ^ 0. Given k[k + l)/2 real numbers dij (t,y = 0,1,..., fc), a 

necessary and sufficient condition for the existence of a simplex (x,-)i=o,i.n 

satisfying di, = x^x j is that for every h = 2,...,k and every h-element 
subset of {0,1,..., fc} the corresponding Cayley-Menger determinant be non¬ 
zero and its sign be (—1)*'+^. 

Proof. The first two assertions follow from 8.11.6 and also 9.7.6.6. The last 
one is shown by induction on k, it being obvious for k = 1. By the induction 
assumption, we can construct in the (A: — 2)-dimensional space Z a simplex 

(x,)i =2 .fc ®uch that x,-xy = (t,y = 2,... ,A:). Next we find in the fc — 1- 

dimensional space Y containing Z two points x^ and xi satisfying Xq,Xi £ 
Y\Z, XiXi = dii and XoX,- = dot for all» = 2,...,k. Call h the projection of 
x[, onto Z; embed K as a hyperplane of a fc-dimensional space X and let IV 
be the codimension-two subspace of X that contains h and is orthogonal to 
Z. Consider a point x describing the circle C of center h in IV and passing 
through xf). This circle intersects Y at x^; as x describes C, the distance xxi 
takes on all values in the interval [xiXqjXiXq]. 
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Now consider the determinant in 9.7.3.2, with the distance replaced 
by an arbitrary real number it is clear that the determinant r(f) obtained is 
of the form r(f) = — f^r(a: 2 ,... ,**) + + with a, 6 R. We investigate 

the change in r(^) as ^ takes values in R.^; we know that r(^) is a quadratic 
trinomial whose term in has sign (—1)*'”^, by assumption. We also know, 
by the first part of the theorem, that r(f) vanishes for two distinct values 
= Xixly and = Xix"; it follows that r(f) has sign (—1)*“^ exactly for 
f 6 \xix[„xix"]. But r(<ioi) is non-zero, and its sign is (—1)*^+^ = (—1)*"^, 
by assumption; we conclude that there exists ato & C \ {xq,!"} such that 
zo2:i = doi- D 




Figure 9.7.3.4 


9.7.3.5. The reader may have remarked two facts in the proof of 9.7.3.4. The 
first is that not all hypotheses were used; in particular, if we know a priori 
that the z,- exist and form a simplex, every subset made up of these points is 
also a simplex. This leads to the suspicion that the algebraic properties of F 
are such that, if r(x(), x^,..., x„) is non-zero and has the right sign, the F of 
any subset will have the same properties. Such is indeed the case, cf. 9.14.23. 
Thus the necessary and sufficient condition in 9.7.3.4 could be weakened to 
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read that just the determinant F of the dij- [i,j = 0,..., k) has to be non-zero 
and have sign (— 

The second observation is that 9.7.3.4 does not answer the question posed 
in 9.7.2 in the case when the points do not form a simplex. The solution in 
this case is a bit longer, though not more difficult; it is stated in 9.14.23. 

The determinants F also give a criterion to determine if n-|-2 points of the 
n-dimensional space X lie on a sphere (cf. 9.7.5), and solve the related problem 
of determining the radius of the sphere circumscribed around a simplex. For 
this we introduce a new determinant A as follows: 


9.7.S.6 


A(a:i,...,a:fc) 


0 

0 

‘^2 




9.7.3.7. Proposition. If the points .„ form a simplex in an (n—1)- 

dimensional space X, the radius R of the sphere circumscribed around this 
simplex is given by 

p2 _ _ ^ ^(^11 • • • I ^n) _ 

2 F(a:,,...,a:„) ’ 

in particular, A(xi,..., x,,) 0 for a simplex. 

In order that n + 2 points (a:t)f=i.n +2 n-dimensional space X 

belong to the same sphere or the same hyperplane, it is necessary and sufficient 
that 

A(xi,...,x„+2) = 0. 


Proof. An easy calculation with determinants shows that, if dot = -R for 
t = 1,..., n, we have 

F(xq , X|,..., Xfi ) — 2R F(.X*! j..., Xfi ) A[x],..., Xn); 

the desired formula follows from taking xo as the center of the sphere circum¬ 
scribed around (xi)i=i.„, for then F(xo, xj,..., x„) = 0 by 9.7.3.4. 

The same method would also show that A(xx,..., x„+ 2 ) = 0 if the points 
X, are cospheric, but it wouldn’t work for the converse. For this reason we 
prove the equivalence of the two conditions directly. Choose an arbitrary or¬ 
thonormal frame, and let x^ (i = 1,..., n-t-2; j = 1,..., n) be the coordinates 
of the points x,-. If the points belong to the same hyperplane or sphere, there 
exists scalars a, b, cj, not all of which vanbh, satisfying 

n 

a||xi||^-t-6-|-^CixJ =0 





242 


Chapter 9. Euclidean afline spaces 


for all t = + 2 (cf. 10.7.6, for example). It follows that the two 

determinants below are zero: 


1 


Fr. + 2|! 


Fil 


^n + 2 

—2as{ 


= 0 , 


'n+2 

-2*7 


2®n + 2 


= 0 . 


1 ||®n + 2p ^n+2 

Thus the determinant of the matrix obtained by multiplying one of these by 
the transpose of the other is also zero; but this product is none other than 
A (z: I,..., Xri 4-2)1 

Conversely, if A(a:i,..., a:„ 42 ) = 0, the first determinant above is zero, 
and there must exist numbers a,b,Cj, not all zero, satisfying the condition 
above. By 10.7.6, the points Xi belong to the sphere or hyperplane given by 
the equation 


“II' IP+= 0 - 

t=i 

9 * 7 * 3 * 8 * Sxaxuplcs. Put a = dy2i ^ ~ ^23* ^ ~ We find 

r(a:i, 2212:3) = -(“ + f> + c)(a + 6 — c)(a - b + c)(-o + 6 + c); 

this is the area of a triangle of sides a, 6 ,c (see 10 . 3 . 3 ). 

Put Q = di2d34, P = <^13(^24, T = <iud 23 - We find 

A(a:i,Z 2 , a:3.a:4) = -(a ++'y)(a +- Tf)(a -+ Tf)(-a ++ qf). 


□ 


This is Ptolemy’s theorem (see 10.9.2). 

9.7.4. A FUNDAMENTAL PROBLEM ABOUT METRIC SPACES. The impor¬ 
tant role played by Euclidean affine spaces leads to the question of whether 
they can be characterized in metric terms alone. This fundamental problem 
was solved by K. Menger in 1928; Cayley-Menger determinants are the key to 
the solution, which is not significantly more difficult than the proof of 9.7.3.4. 
A complete solution is to be found in |BL, chapter IV]. 

The analogous problem can also be posed for other classical metric 
spaces, like spheres, elliptic and hyperbolic spaces (cf. chapters 18 and 19). 
The solution to this is also in [BL], which, by the way, is an excellent reference 
for the systematic study of metric spaces with no additional structure. See 
also [BER2|. 


9.7.5. Equidistant hyperplanes 

9.7.5.1. Proposition. Given distinct points x, y of X, the set {z S X \ 
zx = zy) is a hyperplane, said to be equidistant from x and y. (For n = 2, 
this set is also known as the perpendicular bisector of x and y.) 
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More generally, if (a:i)i=o,i,..i,fc are affine independent, the set 
{z S X \ ZXo = ZX\ = ■ ■ ■ = ZXk } 

is a subspace of dimension n + 1 — fc. In particular, if fc = n, that is, if (i,) is 
a simplex, there exists a unique point whose distance to all the vertices of the 
simplex is the same; in other words (cf. 10.7), there exists a unique sphere 
circumscribed around this simplex. 

Proof. Vectorialize X at Xo and observe that, as often before, the required 
relations between distances yield scalar products. Then apply 2.4.8. □ 

9.7.6. Appolonius’ formula, BARYCENTERS and distances. Using 
the notation and the nomenclature of 3.4.5, we recover a well-known result: 

9.7.6.1. Appolonius’formula. Let {(Aj.a:,)} 5e a finite family of punctual 
masses of X, and let (53,-A,-, g), or (0,1), be their barycenter. Then, for any 
z S X, we have 

i i ' I ' 

or 

51 + 2(iy I 

i i 

Proof. It suffices to write za? = zg^ -b gi? -f 2(zg | ^), and to use 3.4.6.5. □ 

9.7.6.2. Corollary. With the same data as in 9.7.6.1 and k a real number, 
put 

2/ = |z € Af A,- zxi = k 

Then 

i) if Ai = 0, 2/ is an affine hyperplane for every k, and is its 

direction; 

ii) if A,- > 0, we have L = if k < Yi 9^i i o.nd L = sphere of center 

1 

otherwise. □ 

The reader can supply the case Yi < 0- 

9.7.6.3. Corollary. With the same data as in 9.7.6.1 and the condition 

Yi ^ function z i—» Yi^i^^i ei<^hieves its minimum at g and 

nowhere else. □ 

9.7.6.4. Corollary 9.7.6.3 shows that the center of mass of a finite set of 
points is invariant under any isometry that leaves this set globally invariant; 
but this also follows, and more simply, from 3.7.3. and 9.1.3. 

9.7.6.5. Corollary 9.7.6.2 allows one to describe an impressive number of 
loci. We start by recovering 9.7.5.1, upon taking the family {(l,a:), (—1, j/)}. 


g and radius 
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More generally, the locus of points z such that the distance from z to two 
fixed points z, ^ is a constant A; ^ 1 is a sphere of center 

z _ fey 

1 — A: 1 — A: 

Similarly, the set {a 6 AT | + zy^ = A: } is a sphere, and {z S X \ 

zx^ — zy^ = A:} is a hyperplane; in the first case we have the “formula of the 
median”: 

zx^ + zy^ = 2 zg^ + 2 = 2 zg^ + - xy^. 



Finally, the set {z | zx^ — zy^ = A:} is a hyperplane, orthogonal to xy. 

9.7.6.6. Formula 9.7.6.1 gives a way of proving the first statement in 9.7.3.4 
without having to fiddle around with determinants (cf. 9.7.3.3). It goes like 
this; by 3.6, at least one of the points, say z„, can be written as a barycenter 
of the others, z„ = with This means the barycenter 

of the family 

{(^0i 2;o),..., (Ai— 1 , (~1, 2;,,)} 

is (0, 0)! By 9.7.6.1, the function does not depend on z; let k be 

its value. Substituting Zo, zi,..., z„ for z we get 

+ ^2(1?,2 H h ^u-ldo.n-l ~ ^ 


+ A2d^,2 + • • ■ + 

A(, + Ai + A2 +•■•+ A„_i 

whence r(z(), zi,. .., z„) = 0. 


= k 
1 = 0 , 
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9.8. Stabilizers of subsets 


9.8.1. In the study of a subset ^4 C AT, it is natural to introduce its stabilizer 
or isotropy group Isji(A^) = | g 6 IsfAT) j g{A) = A } in Is(X). The bigger 
Isji(X), the more “symmetries” A has; Isyi(Jir) is intimately associated with 
A. We have already met the subgroups Isyi(Jir) in chapter 1, and they will be 
essential in 12.5. The map x : A •—» Isyi(Jf) from the set of subsets of X into 
the set of subgroups of Is(Ar) is a unifying thread in this section. 

9.8.2. Remarks. If A is compact and g € Is(Jir), the condition g(A) C A 
implies g(A) = A, but this no longer holds if A is just closed or bounded 
(cf. 9.14.26). 

In general, Isyi(Ar) C Isj(X), where A is the closure of A, but the inclu¬ 
sion may be strict, cf. 9.14.27. 

With the metric induced from X, the subset A has a group of isometries 
Is(A), whence a restriction map p : Isyi(X) —» Is(A). In general, p is not 
injective; for instance, if V = A is a proper subspace of X, the reflection 
try induces the identity on X. Injectivity holds if (A) = X, where (A) is 
the affine subspace generated by the set A (see 2.4.2.5). Surjectivity always 
holds, by 9.7.1. 

The map y itself is neither surjective nor injective. The two sets in figure 

9.8.2. for example, have the same group of symmetries; on the other hand, 
given w € Jf, the subgroup G = Is^ (Jf) cannot be the stabilizer of any subset 
A, for any orbit of G is the union of spheres of center w, and thus must be 
invariant under ISij(Jf). However, if (? is a finite subgroup of Is(Jf), there 
always exists A CL X such that G = Isj 4 (Jf) (see 9.14.36). 



9.8.3. Closedness. Unlike Is„(X), where o 6 X is a point, Isji(X) is not 
closed in Is(X): just take A to be an orbit of G, where G is the subgroup of 
Is(X) generated by an irrational plane rotation. But if A is closed in X, so 
is Isji(X) in Is(X). 

9.8.4. Boundedness. If A is bounded, so is Isyi(X). For, as we shall see 
below, Isj(X) is bounded, and Isj 4 (jf) C Is;;;j(Jf) (cf. 9.8.2). On the other 
hand, it may happen that ISj 4 (X) is bounded but A is not: see 9.14.27. 
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9.8.5. Finiteness. It may happen that l6ji(X) is finite and A is not (see 
9.14.27), and that A is finite but ISj 4 (X) is not—for example, when A is 
contained in a subspace of codimension > 2. On the other hand, if dim(A) > 
dimX — 1, the finiteness of A implies that of Isji(A^). This is clear, since a 
map g 6 Is(X) that induces the identity on A also induces the identity on 
(A); thus, if (A) = X we must have g = Idxi and if (A) is a hyperplane we 
have g = Idx or a'(yi)- 

9.8.6. Compactness and fixed points. 

9.8.6.1..- Proposition. If A is a compact subset of X, there exists x € X 
such that Isx(X) C Is,i(Ar). In particular, Isx(X) is compact in Is(X). If 
G C Is(X) is compact, there exists x€ X such that G C ISa;(Ar). 

Proof. The second part follows from the first, by considering any orbit of 
G. The “in particular” follows from 9.8.3 and the compactness of Isa;(X) 
(cf. 8.2.3.3). The first assertion can be shown in at least three ways; 

9.8.6.2. First proof. This is the same as the proof of 2.7.5.9; here the 
condition that the interior be non-empty is not necessary, since Isj(X) is 
always compact, contrary to GA*(Ar) = GL(Ar). 

9.8.6.3. Second proof. This follows from 11.5.8. In fact, denoting by B the 
ball of least radius in X containing A, we have g(B) = B whenever g(A) = A, 
since B is unique and is defined in purely metrical terms. The desired fixed 
point X is the center of B. 

9. 8.6.4. Third proof (Bruhat-Tits lemma). This lemma guarantees the 
existence of a fixed point for the group of isometries of a compact subset of 
any metric space in a certain class. This class encompasses Euclidean affine 
spaces and many others, including hyperbolic spaces, cf. 19.4.7. 

Observe that 9.8.6.1 cannot hold for an arbitrary metric space; for ex¬ 
ample, the isometry group Is(5) of the sphere 5 centered at x does not leave 
any point of 5 fixed, and yet is compact, Is(5) = Isj(X). 

The class of metric spaces X under advisement is characterized by the 
following property: 

(CN) For every x,y S X there exists m€ X such that 

d^{x,z) + <f(y,z) > 2(P(m,z) + y) 

for all z € X. 

To see that this condition is satisfied by Euclidean affine spaces, take m to 
be the midpoint of x and y; the formula of the median (9.7.6.5) shows that 
equality holds in (CN). The midpoint of a segment also works in hyperbolic 
space (19.4.7). On the other hand, spheres do not satisfy (CN). 

9.8.6.5. Lemma (Bruhat-Tits) (cf. |BR-TI, 63]). Let X be a complete 
metric space satisfying (CN), and let A be a bounded subset of X. There 
exists a point x such that, for every g 6 Is(Jf) satisfying g(A) = A, we have 
g(x) = X. 
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Proof. First, an idea of the proof, assuming A compact. Consider the set fi(A) 
of points m obtained by applying (CN) to pairs of points x,y S A satisfying 
— diam(A). Condition (CN) implies that diam(/i(A)) < Adiam(A), 
where A < 1 is a universal constant; we thus obtain a decreasing sequence of 
compact sets whose diameter approaches zero, and the intersection of these 
sets is the desired point. 

Now for a detailed proof in the general case where A is a bounded subset 
of X. Fix k € (0,1] and, for wery Y d X, denote by /i(K) the set of points m 
obtained by applying (CN) to pairs x,y 6 Y such that d(x,y) > fcdiam(K). 
Condition (CN) gives 

sup { d(x, y) I X e y, y e ^(K) } < ki diam(y), 
diam(^(y’)) < diam(y’), 

where ki = y/l — k'^/A and k -2 = — k'^/2. Define, by induction, 

/x"(y)-/x{^"-*(y)). 

Let A be bounded; from the inequalities above, we deduce that 
diam(/x"(A)) < diam(j4) 

for all n 6 N*, so the intersection of the fi"(A) has at most one point. 
Let (z„)„eN be an arbitrary sequence of points z„ 6 fi"(A), n 6 N. By 
the preceding inequalities, d(x„,z„ 4 i) < kik” diam(A), so (x„) is a Cauchy 
sequence, and converges to a point x by completeness. Thus HneN /^"(•^) — 
{x}. Since'the sets fi"(A) are obtained from A by a purely metric procedure, 
they must be invariant under Isj 4 (X), and the same holds for x. □ 

9.8.7. Remark. As in 2.7.5.11, one sees that all maximal compact sub¬ 
groups of Is(X) are conjugate. 

9.8.8. Examples. For examples, see the proof of 1.7.5.1 and [BR-TI, 64]. 
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9.9. Length of curves 


In this section M denotes an arbitrary metric space, whose distance is 
denoted by d(-, •). 

9.9.1. Definitions. A curve or path in M is a pair ([o,fc],/) (o < b) 
consisting of a closed interval o/R and a continuous map f : [a, b] —* M. The 
endpoints of ([o,6],/) are the points {/(a),/(6)); they are said to be joined 
by f. (Sometimes f(a) is called the starting point and f(b) the endpoint). 
The set of curves joining f(a) to f[b) is denoted by C{x,y). The length of 
f is the scalar leng(/) 6 R+ U oo given by 


sup 


to < ti <■■■< tn = b and n 6 N* 


The curve f is said to be rectiSable t/leng(/) 6 R+. 

9.9.2. Remarks. Saying that C(x,y) ^ 0 for every x,y S M is the same 
as saying that M is path-connected. If 6 ; [c, dj —> is a homeomorphism 

and / : [a, i)| —♦ M is a curve in M, we have leng(/ o = leng(/). FVom 
the point of view of the length, we can then take the quotient of the set of 
curves by the equivalence relation thus obtained. The quotient classes are 
called geometric arcs; this notion shall not be needed in this book. 

Observe that a drawing like 9.9.2 assumes the existence of “segments" 
connecting each pair of consecutive points (/(ft))/(ft-n)); we shall discuss 
this matter in 9.9.4.2. 



9.9.3. Examples. 

9.9.3.1. One necessarily has leng(/) > d{x,y) for every / 6 C(x,y). 

9.9.5.2. U f : \a, b] —* M and g : \b, c] —> M are curves such that f(b) = g(b), 
the curve f U g : [a, c] —> M satisfies 

leng(/ U ff) = leng(/) + leng(ff). 

9.9.3.3. Even in good spaces M, there exist non-rectifiable curves. For 
example, one can construct a curve in the plane R^ by iterating infinitely 
often the procedure suggested by figure 9.9.3.3. (The reader should verify 
that the map thus obtained is continuous.) The length must be > (4/3)" for 
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every n 6 N*, so it cannot be finite. Observe also that, for any t < t' S [0, l], 
the length of the restriction /||t.t'] is still infinite. 

In spite of their pathological appearance, such curves (called “fractals”) 
occur very naturally; as coastlines, for example (cf. |MB]). 

9.9.4. Segments. Intrinsic and excellent spaces 

9.9.4.1. Definition. A curve f ; [a, fcj —» M is called a segment if d{f(t), 
f(t')) = t' — t for every t < t' € [a, fc]. In particular, leng(/) = d[f(a), f(b)). 

9.9.4.2. No conflict arises with the notion, introduced in 3.4.3, of the seg¬ 
ment [z, y] with endpoints x,y in a. Euclidean affine space X. Indeed, such a 
segment determines a segment in the new sense, unique up to a translation 
of its interval of definition: 

[a, a + d(x, y)| 9 1 1 -+ f(t) = z -f € X. 

d[x, y) 

f f(r) f(v) ^ 

Figure 9.9.4.2 

In general, even if C(x,y) ^ 0, there may not be a segment joining x 
and y. There are reasonably general conditions that guarantee the existence 
of segments; see [CT, 135] or [BL, 70]. 

9.9.4.3. Counterexamples are easily manufactured; take M = \(0,0) 

with the metric induced from R^. There is no segment joining x and —x, as 
such a segment should have length 2||z||, and the only path in R^ with this 
length is the segment [z, —zj, which unfortunately contains (0,0) ^ M. 

Another example is the sphere S” C R"'*'*, with the metric induced 
from R"+*. For any distinct points z, y € 5", and any curve / joining z and 
y, we have leng(/) > d(x, y), because 

leng(/) > d{x,f(t)) +d{f(t),y) 
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Figure 9.9.4.3 


for any t 6 ]a, fc[, and 

/(O) + «^(/(<)i y) > y) 

by 9.1.1.1. 

Such counterexamples lead naturally to the following 

9.9.4.4. Definition. A metric space M is called intrinsic if it is path- 
connected and d(x,y) = inf { leng(/) j / € (7(2:, y) } for every x,y € X. It is 
called excellent if, for any x,y & X, there exists a segment joining x and y. 

9.9.4.5. Examples. By 9.9.4.1, excellent spaces are always intrinsic. The 
converse is not true, as can be seen from the first example in 9.9.4.3. 

Euclidean affine spaces are excellent, by 9.4.4.2. Moreover, 9.1.1.1 shows 
that a segment joining two given points is unique (up to a translation of the 
interval of definition). This is not true for other excellent spaces, as can be 
seen from the example of the circle (9.9.8). 

We shall meet other excellent spaces in 18.4.2, 19.1.2 and 19.3.2. The 
first of these examples deals with the sphere, which, as seen in 9.9.4.3, is not 
an intrinsic space with the metric induced from R"'*'*; this flaw is remedied 
in 18.4. 


9.9.5. Shortest path. One often hears that, in a Euclidean space, the 
shortest path between two points is the segment joining them. More precisely, 
if / : [a, 6] —♦ X is a curve joining x and y and having length leng(/) = d(x, y), 
we have, for every i < t' 6 |a,6[: 

/(i')elz,y| and /(() e [z,/((')]. 


This, too, is a consequence of 9.1.1.1 and the following elementary remark: 
if leng(/) = d(x,y), then, for every t e [o,fc], leng(/|[„.t]) = d{x,f(t)) and 
leng(/|[t.h]) = d{f{t),y). 


9.9.6. Remarks. Let X be a Euclidean affine space. Recall (cf., for exam¬ 
ple, |DR, 314]) that if a curve / : |o,6] —♦ X is of class (7*, it is rectifiable 
and has length 


leng(/) = 
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An analogous formula holds for curves in differentiable submanifolds of Eu¬ 
clidean affine spaces, and even in abstract riemannian manifolds. If the man¬ 
ifold is complete, the metric obtained is excellent, and uniqueness of seg¬ 
ments holds locally. These considerations lead to the fundamental notion of 
geodesics: see [BERl], [KGl, 144 ff.) or [MA, 271], for example, for these 
generalizations. 

9.9.7. Systematic construction of an intrinsic metric. If (M,d) 
is a non-intrinsic metric space, we can almost always derive from it an intrinsic 
metric d via the following natural procedure: 

9.9.7.1. Proposition. Let (M,d) be a metric space such that, for every 
x,y S X, there exists f 6 C[x,y) such that leng(/) < oo. Then 

d : M X M B {x,y) d(x, y) = inf { leng{/) | / e <7(1, y) } 6 R+ 
is an intrinsic metric on M. Furthermore, d = d. 

Proof. Left to the reader (see 9.14.30). □ 

The fact that d = d means that the procedure stops after one step. 

In the case of (5”, d), where d is the metric induced from R”'*'*, the new 
metric d obtained is none other than d(x,y) — Arccos((a: j y)), introduced 
in 8.6.3. We study the case n = 1 in 9.9.8 and the general case in 18.4 
(see 18.4 .3). C omputing d with the help of 9.9.6 and the parametrization 
t t-* (f, V^l — t^) of the circle S*, we recover the measure of angles obtained 
/■* ds 

from the integral / - - this process is somewhat more elementary than 

JO vl — 

the one employing complex exponentials (8.3.13). 

9.9.8. The intrinsic metric of the circle. Consider the circle C = 
{ 2 6 I llzjl = 1 } in the Euclidean vector plane X. This can be identified 
with the set of half-lines of X (cf. 8.6.1). 

9.9.8.1. Theorem. With the metric sy = Arccos((z j y)), the circle C is an 
intrinsic metric space. The relation zy-l-yz = zz holds if and only if the half- 
line R+y lies between R+z and R+z (cf. 8.7.5.2). Given x,y € C, x^ —y, 
the unique shortest path from x to y (cf. 9.9.5) is the arc of circle from x to 
y (cf. 8.7.5.4); if x = ~y, there are two shortest paths, the semicircles with 
endpoints x and y. 



Figure 9.9.8.I.1 
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Proof. Consider three points x,y,z 6 C. Leaving aside the cases when two 
of them are equal or opposite, which are treated directly, we are left with the 
four possibilities shown in figure 9.9.8.1. 1 . Prom 8 .7.5.3, we see that in case I 
we have xy+yz = xz; in case II, xy + yz+sz = 2 ir; in case III, xz = yz —xy; 
and in case IV, xz = xy —yz. Bearing in mind that'xy+yz > ir in case II, we 
do obtain xy + yz > xz in all cases, and equality only in case I. In particular, 
((7,“) is a metric space. 

To see that this metric is intrinsic, we show that it is excellent. Let 
x,y 6 (7. Following the construction in 8.7.5, let xy = t, orient X and join x 
to y by the curve 

/ ; [ 0 , <1 3 s I-+ cos s • ei + sin s ■ 62 6 (7. 

For every s, s' 6 [0, with s < s', we have f[s)f[s') = s' — s, so / is really 
a segment. As for shortest paths, we just have to consider the strict triangle 
inequality as given in the first part of the theorem, and the remark at the 
end of 9.9.5. 



Figure 9.9,8.1.2 


9.9.9. Note. For results on curves in general metric spaces, and in par¬ 
ticular for a definition of curvature and torsion that does not use differential 
calculus, see [BL, 74 ff.) or [B-M, chapter 10); see also 9.14.31. 

9.10. Distance and differential geometry. The first 
variation formula 

This imposing-sounding formula merely gives the derivative d' of the 
distance function d : X x X —^Rona Euclidean affine space. In spite of its 
simplicity, it has numerous nice consequences. See also 9.10.7. 

9.10.1. The first variation formula. For every pair of distinct points 
x,y S X and every u,v & X, we have 

d'(x,y)(u,v) = —^(iy I w - u) = ||t7|| cos(i7, xy) - ||u|| cos(u, xy). 

Hoi'll 

where the angles are between oriented lines (cf. 8.6.3). 

Proof. Let e be the square of d, so that 

e(x,y) = <f[x,y) = ||xy||^. 
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X 


Figure 9.10.1 

From the rules of differential calculus, 

e'(x,y)(il,v) = 2[xy\v- u), 

since the scalar product is bilinear (see (CHl, 33], for example). Again by 
calculus, the derivative of d = y/e is given by the desired formula. □ 

9.10.2. For example, if two curves C and C are everywhere orthogonal to a 
family D(t) of lines, we must have d(Cn£)(t), C"n£)(l)) = constant, because 
the two angles in 9.10.1 are equal to 7r/2 and their cosine is zero. In particular, 
the tangent at m to a curve drawn on a sphere of center x is orthogonal to 
the line (x,m) (compare 10.7.4). 



9.10.3. Let x(t) be a point describing a curve C of class C^, assumed to 

be regular (i.e., 2 :'(f) / 0 for all t). Let be the tangent to C at x(t), 

and y(t) a point of that describes a curve D everywhere orthogonal 

to Tx^^)C (this is the case, for example, if C is the evoJute of D, that is, the 
envelope of the normals to D). Under these conditions we have, for all values 
of s,t: 

|x(f)y(f) - 2 (s)y(s)| = leng(C'||,.t|). 

To see this, take u = x'(t) and v = y'(t) in 9.10.1. The angle between v and 
xy will be 5r/2 and have cosine zero, whereas the angle between u and xy will 
be 0 or ir. Thus d'(z(f), !/(f))(x'(f), y'(f)) = i:||x'(<)||, from which we obtain 
the desired formula by integrating from s to < and using 9.9.6. 

9.10.4. A similar but somewhat more complicated example, to be used in 

17.6.4. is the following: two points x(t), y(t) describe the same curve C, 
assumed regular. The tangents Ti(t)(7, Ty^t^C intersect at the point z(t) — 
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n (TyitjC), which describes a curve D such that the tangent 
is the exterior bisector of the oriented lines z(t)x(t) and z(t)y(t). Then 
^(0 = 2 :(<) 2 (<) + y{t)z{t) - leng(C|fron, i(i| to vui) = constant. 

To see this, differentiate this function using 9.9.6 and 9.10.1; 

r(t) = ||2>(0||cos(^(0 ;(0,2>(0)-| x'(0|| 

+ lk'( 0 || cos(y(<)2(0.*'(0) - l|s''( 0 ll - Ik'IOll + lk'( 0 ||- 

The two terms involving || 2 '(t)|| vanish because is assumed to be the 

exterior bisector. 

9.10.5. Formula 9.10.1 could have been used to guess 9.4.1.1. In fact, if 
F = ax + bx has a miniinum at x, we have, for any vector ^ & D: 

+cos(f, xb)) = 0 , 

so D must be the exterior bisector of xa and xb (figure 9.10.5). 

9.10.6. Formula 9.10.1 can be used to guess the solution to the following 
problem of Fermat: given a triangle {a,b,c} in the Euclidean plane, find a 
point X that minimizes ax + 62 ; + cx. If x achieves this minimum and is not 
a vertex, we have, for every u £ Jf: 



In other words, the three unit vectors in the direction of ax, bx and cx must 
add up to zero, and this is the case if and only if their angles at x are all 
equal to 2n'/3. This is the starting point for the solution developed in 10.4.3. 
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9.10,7. Note. The name “first variation formula” signifies the fact that we 
are dealing with the first derivative d'. The computation of d" does not result 
in anything new that cannot be done directly (roughly speaking, 9.2.2). For 
submanifolds of Euclidean spaces or abstract Riemannian manifolds, on the 
other hand, the “second variation formula”, giving d", is a fundamental tool. 
It is from this formula that most global results in Riemannian geometry are 
derived, and it is in the second derivative that one finds the curvature of the 
manifold (which is zero in the Euclidean case). For recent references, see, for 
example, [KO-NO, volume 2, chapter VIII] and |G-K-M, 121 ff.j. 


9.11. The HausdorfF metric 

For later use, especially in chapter 12, where it will be essential, we 
define a metric structure on the set of all compact subsets of a Euclidean 
affine space. This metric is due to Hausdorff; we start by considering it in a 
slightly more general setting. 

9.11.1. Notation. Let X be a metric space. Denote by K = K{X) the set 
of all compact sets of X. If F is a subset of X and p > 0 is a reed number, 
put (cf. 0.3) 

U(F,p)^{xeX\d{x,F)<p}, B(F.p) = {xeX\d(x,F)<p}. 

If F,G are subsets of X, define the Hausdorff distance between F and G as 
the real number 

6(F, G) = inf { p I E c S(G, p) and G c B(F, p) }. 

Watch out for the difference between 6(F,G) and d(F,G) (cf. 0.3). 



256 


Chapter 9. Euclidean affine spaces 


F=[-*] 

G=:BiF,2“'”5! 
~ '=H\G 



9.11.2. Theorem. The Hausdorff distance makes /C(X) into a metric space. 
If X is such that every closed bounded set is compact, then (/C(AT), (5) is 
complete (resp. compact) if X is. 

Proof. The fun^ion <5( , ) is obviously symmetric. If S(F,G) = 0, we have 
F c B(G, 0) = G = G and G c F, whence F = G. Let F, G, H be arbitrary 
subsets of X satisfying G C B(H,cr) and F C S(G, p); then F C B(G,p) C 
B(H, a p). Thus, if S(F,G) = p and S(G, H) = a, we have 6 (F,H) < p + a. 

FVom now on it is understood that K(X), or just K, is endowed with the 
Hausdorif metric. Let (/’njneN be a Cauchy sequence in K. For each n G N, 

set _ 

Gn — Fn + p. 

peN 

Each G„ is bounded because, since [F„) is a Cauchy sequence, there exists no 
such that S(F„, F„i,) < 1 for all n > no, and F„ C B(Fn„, l) for n > no. We 
thus have a decreasing sequence (G„) of bounded and closed, hence compact, 
sets; by a classical result of general topology (cf. 0.4), the intersection F = 
n„6N^n is non-empty. The set F is in /C; there remains to see that F = 
lim„_oo F„ in K. Given e > 0, let no be such that i5(F„, Fn^) for all n > no. 
Then F C G„„ C B(F„„,e). In the opposite sense, there exists ni such that 
G„ C B(F, e) for all n > n^; taking n > sup (no, n^) gives 6(F, F„) < e. 

To show that K is compact when X is, recall that compactness is equiv¬ 
alent to completeness and the open cover property (for every e > 0 the space 
has a finite cover by sets of diameter < e). So assume X is compact and 
take a finite cover of X by balls B(xi,e). Denote by Ko the set of subsets 
of { 2 : 1 ,... , 2 , 1 }; we have Ko C K. We show that K = e); 
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this is enough since #/Co = 2” < oo. For G G K arbitrary, introduce the 
set F = { Zj I d(xi,G) < e }• By construction, F C B(G, e); but since X — 
IJiLi B(xi, e), there exists for any j/ G G an such that y G B(xi,t), showing 
that z, G F and G C B(F,e). Thus 6(F,G) < e, or again G C Bn(F,i). 



9.11.3. Corollary. If X is compact, finite subsets form a dense subset of 
K{X). 

Proof. This follows from the last pzirt of the proof of 9.11.2. □ 

9.11.4. Corollary. Let X be a Euclidean affine space. Then K(X) is 
complete. Moreover, for any a € X and r G R+, the set KaT(X) = { F G 

|Fc J3(a,r)} is compact. 

Proof. This result is often called the “Blaschke selection theorem” (|EN, 64]); 
it implies that every infinite family of sets /Ca.r(X) has an accumulation point, 
or, in other words, one can pick compacts subsets of X with certain limit 
properties. We shall use this in 9.13.8 and 12.11.1. 

9.11.5. Proposition. For every metric space X, the diameter function 
diam : K(X) •—> R (cf. 0.3) is a Lipschitz map with constant 2. 

Proof. Let F, G G /C be such that S(F,G) = e, and take x,y €: F satisfying 
d(x,y) = diam(F). There exist z,t €G such that d[x,z) < e and d(y,t) < e; 
this implies 

diam(F) = d(x, y) < d(x,z) + d[z,t) + d(t, y) < 2e + diam(G). 
Switching F and G gives |diam(F) — diam(G)| < 2e. □ 

9.11.6. Proposition. Let X be a Euclidean affine space and H a hyper¬ 
plane in X. Then p : K(X) ^ K >—* p(K) G K(H), the orthogonal projection 
on H (cf. 9.2.4), is Lipschitz with constant 1. 

Proof. It is enough to observe that, for any K & K (X) and any real number 
p, we have 


p{Bx(K,p)) = B!,{p(K),p). 


□ 
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9.11.7. FVom the triangle inequality we deduce that, for every p,a, F and G, 
6{B(F,p),B[G,a))<6[F,G) + \p-a\. 

9.12. Canonical measure and volume 


9.12.1. Here we combine 2.7.4 and 8.11. In 8.11 we saw that a Euclidean 
vector space X possesses a canonical Lebesgue measure, obtained, for exam¬ 
ple, as the image of the Lebesgue measure of R” under any vector isometry 
R" —* X. By 2.7.4.3, the canonical measure on X gives rise to a canonical 
measure on X, which we denote by fi or fix if necessary. By construction, if 

{x^},_o 1 .„ is an orthonormal frame for X and / : —> R is integrable, we 

have 

/ /m= / /^. 

JX JR’' 

where /to is the Lebesgue measure on R” and / abo denotes, by abuse of 
notation, the function on R" obtained from / via the isomorphbm X -+ R" 
associated with the chosen frame. Naturally, fi = fii, when X = R". Clearly 
p, is invariant under ls(X), and is the unique measure (up to a scalar) with 
this property (cf. 2.7.4.4). 

9.12.2. An explicit link with the volume form and the density 6^ b 
provided by the theory of forms and densities on differentiable manifolds. 
The manifold is X, the form and the density, denoted by Ax and 6x, are ob¬ 
tained directly from A^ and because the tangent space to X b canonically 
identified with X, and we have, for every function / : Af —♦ R: 

f f f^x= f fSx. 

Jx JX JX 

9.12.3. Examples. In X, every affine subspace Y X has measure zero. 

If X and X' are two Euclidean affine spaces of same dimension n and 

f : X X' is a. similarity of ratio k (cf. 9.5), the image measure f(px) is 
equal to k’‘px- 



9.12. Canonical measure and volume 


259 



Figure 9.12.3 


Let Y and Z be complementary orthogonal affine subspaces of X. The 
measure fix is the product of the measures fiy and that is, fix = 
and we can apply Fubini’s theorem (cf. 0.6). 

9.12.4. Volume 

9.12.4.1. Definition. The voJume of a compact set K of X, denoted by 
il(ff), is the integral f^XKP'i aihere XK ** ib.e characteristic function of 
K (cf. 0.6). In dimension one (resp. two) we generally talk about length 
(resp. area.) instead of volume. 

These are really our everyday concepts of volume and area. Here we compute 
only the volume of a box and a simplex; for the volume of common solids, 
like balls, the reader can consult 9.12.4.7, 9.12.4.8 and 12.12.10. 

9.12.4.2. The box built on the points {zt}i=: 0 .i. n is the set 


^ = { 2:0 + ^ A, e [0,1] for t = 1,..., n |. 

The solid simplex built on the same points is the set 

“ I ~ ^ At > 0 for i = 0,1,..., n >. 

^ is=o I 


1=0 


We have the formulas 

9 . 12 . 4.3 jC(P) = S^(xoxl,...,XoXn), jC{S) = 

The first formula follows from 9.12.3 if {*, } is not an affine frame; otherwise 
{z,} determines a linear isomorphism f : X —* R” (which is generally not an 
isometry), and the image measure f[ft) is equal to 6^(xqXi, .,. by 

2.7.4.3 and 8.11. Now f(P) is the unit cube in R”, and its volume is 1 (by 
Fubini, for example); this gives the formula for the box. 

The second formula is obtained by induction, using the intermediate 
formula 9.12.4.4. (We can assume {z,} is an affine frame, otherwise £(5) = 
Il{P) = 0.) Let r) be the distance 

n = d(2:o,(z,,...,z„)) 

from zq to the liyperplaiie Y — (zi,...,z„), and cr the volume of the solid 
simplex S' built on the points {z, },= i.„ of the (n—1-dimensional) Euclidean 





260 


Chapter 9. Euclidean affine spaces 


space H. Then 


9.12.4.4 


Il{S) = -ria. 
n 


For n = 2 , this says that the area of a triangle is equal to one half the altitude 
times the base, and for n = 3, that the volume of a tetrahedron is one third 
of the height times the area of the opposite face. 




To prove 9.12.4.4, let D be the line orthogonal to Y and containing xo- 
We can parametrize £) by a real number t so that D is isometric to R and so 
that t = rj at D n F and t = 0 at xo- By 9.12.3, we have 

£(5)= f\(t)dt, 

Jo 

where <T(t) denotes the volume (in the Euclidean space of 

solid simplex ,(■?*) obtained from S' by a homothety of center X(, and 

radius t/rj. But this volume is equal to 9-12.3; we obtain 



9.12.4.5. It may seem wasteful to use integration theory to calculate the 
volume of a box or a simplex; a more elementary theory is developed in 

12.2.5. 

One may also want to calculate C(S) as a function of the lengths d^j = 
2 , 2 j of the simplex S only; this follows immediately from combining 8.11.6, 
9.7.3.2 and 9.12.4.3. 

9.12.4.6. Volume of balls. We denote by B„(o,r) the ball of center o and 
radius r in the n-dimensional space X, for any a €: X, r €: R 4 .. Then 


iI(B 2 ,i+i(a, r)) 


. 2 .£ 


d\ 


2.£+i^.i 


, 2 ,i+ 1 


13 5 


(2d+ 1) 
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By 9.12.3, it is enough to find 1)) in the standard space R". This 

is a classical computation and can be carried out in various ways (cf. [B-G, 

6.5.7 and 6.10.9]). In the sequel we shaU often need the special value 

9.12.4.7 ^(n) = £(B„(0,l)). 


9.12.4.8. Since we’re at it, we give the (n — l)-dimensional volume of the 
sphere 5„(0,1) (cf. 9.12.7, 12.10.8, [B-G, 6.5.6.1|). Let Q(n) be the (n - 1)- 
dimensional volume of the unit sphere in R"; we have a(n) = nfi(n) 

for every n. Thus 


a(2d) = 


a(2d+ 1) = 


2if^ 

Jd^.' 

2d+l^d 

13.(2d - 1)' 


9.12.4.9. Orthogonal projection. Let H and H' be hyperplanes of X, 
and let a G [0,7r/2| be the angle between them: a = Denoting 

hy p : H' H the restriction to H' of the orthogonal projection from X onto 
H, by £// (resp. £.h‘) the volume in H (resp. H') and by K an arbitrary 
compact set in H', we have 

Ch{p(K)) = cosQ £.hi(K). 


To see this, take an orthonormal frame in H' that has n — 1 vectors in Hr\H'. 


9.12.5. Volume and HaUSDORFF distance. Even for compact sets, it 
is not to be expected that the volume function i! : IC —+ R is a simple one. 
For example, there exist compact sets K c R^ whose boundary dK not only 
has positive area, but has positive area everywhere (see |G-0, 135]). The 
function il : IC —* R is certainly not continuous, otherwise (cf. 9.11.3) all 
compact sets would have volume zero, because finite sets do. But we do have 
the following 

9.12.5.1. Proposition. K R is upper semicontinuous. 

Proof. Assume that F = limn_oo in /C; we show that limsup„_oo xf„ < 
Xf- Let X e X\F-, there exists no such that 6(F, F„)< d(x, F) for all n > no. 
Thus XF„(2:) = 0 for all n > no- □ 



Figure 9.12.5.1 
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9.12.5.2. Note. We shall see in 12.9.3.4 that i! is continuous when restricted 
to convex compact sets. 

9.12.6. Centroids. Let K he a. compact set in a Euclidean affine space X. 
Assume fc ^ and let g = cent'(if) (cf. 2.7.5.2). A generalization of 9.7.6.1 
with all the Aj — 1 is the following; for every x G X, 

9.12.6.1 j XKxa^ti=f xk go.^tJ'+l(K)xg^. 

Jnex Ja€X 

Here again the proof consists in writing 

xa^ = xg^ + 2(xg \ ga) + ga^. 

To generalize 9.7.6.1 with arbitrary A,- > 0, we choose on K an arbitrary 
positive measure 6. If the total mass of if is positive, we can define the 
barycenter of K for the measure as the vector 



which does not depend on x. We then have, for every x & X, 

9.12.6.2 f xa^e:=l ga^B + ([ 

J,teK Ja€K \JaeK ) 

9.12.6.3. Notes. Such measures 6 are naturally encountered when dealing 
with curves in the Euclidean plane or surfaces in Euclidean three-space, for 
example. One uses their canonical measure, cf. |B-G, 6.4]. 

Formulas 9.12.6.1 and 9.12.6.2 show that the function 



has a minimum at g, and only there. In mechanical terms this means that the 
moment of inertia around the point g is the smallest possible, so if you want 
to spin something around a point with the least possible effort, you should 
do it around g. This is clearly useful to know in practice! 

For a nice relation between volume and center of mass, see 12.12.20.9. 
For mechanical methods for the calculation of areas, see [GK, 72 ff.j. 

9.12.7. The fc -dimensional volume. If C is a differentiable curve in 
the plane its volume C(C) is zero, and similarly for, say, the sphere 5^ in 
R^. This concept of volume is not much good for such objects. For C the 
appropriate notion would be its length, defined in 9.9 (at least when C is 
a homeomorphic image of its interval of definition), and for 5^ it would be 
its area, equal to Ax (cf. 18.3.7, for example). In general, we would like to 
characterize “fc-dimensional subsets” of X, and define for such subsets their 
“fc-dimensional volume” (area if fc = 2, length if fc = 1). This is a very difficult 
problem in general, though certainly natural (think of paintiug a solid; for an 
application of this idea, see 12.10.7). 
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If we restrict ourselves to submanifolds of X, no difficulty arises: such 
a submanifold possesses a canonical measure, so if it is compact it has a total 
mass, and that is its A:-volume (k being the dimension of the manifold). See, 
for example, [B-G, 6.4). On the other hand, if we want to break free from 
this very restrictive condition, which precludes even simple bodies as in figure 
9.12.7.1, and allow certain kinds of singularity, no one notion of A:-dimensional 
volume (2 < fc < n = dim AT) will do. There are, in fact, many notions, all 
essentially different. For a complete discussion of this question, see |FR], in 
particular pages 171-174, where no less than seven fc-dimensional measures 
are introduced. For k = n = dim X, all these measures coincide with the 
volume, and for compact differentiable manifolds they all coincide with the 
notion discussed above. 



Figure 9.12.7.1 


There is, however, one particular case that we discuss in detail, namely, 
frontiers of compact convex sets. These possess a canonical (n—l)-dimensional 
volume, which we call area for the sake of simplicity. We first define it for 
polytopes (12.3), then approximate convex sets by polytopes (12.10). See 
also the case of the sphere in 18.3.7. 

The reader can get a feeling for the pitfalls involved in this process by 
considering the well-known example of the Chinese lantern, where we try to 



Figure 9.12.7.2 (source: [B-G]) 
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find the area of a cylinder by approximating it by polyhedra, by analogy with 
the one-dimensional case (figure 9.12.7.2). Unfortunately, depending on the 
ratio between the number of horizontal slices and the number of sides of the 
regular polygons as both go to infinity, the surface area of the polyhedra can 
be made to approach any limit between the area of the cylinder (the limit we 
really want) and infinity. 

9.13. Steiner symmetrization 

This operation, which transforms a compact subset of X into another, 
plays a fundamental role in the proof of certain inequalities about compact 
sets: see 9.13.8, but especially the conclusion of chapter 12. Apart from its 
inherent beauty, we study its properties here with an eye to applications in 
chapter 12. 

Let X be an affine space, and fix a hyperplane H of X. Recall that ajj 
stands for the reflection through X (cf. 9.2.4). Let be a compact subset 

of a:. 

9.13.1. Definition. The Steiner symmetTiz^tion of K with respect to H, 
denoted by Btff(K), is the compact set K' = sti]{K) uniquely defined by the 
following condition: 

9.13.2. If D is any line orthogonal to H, either K H D = 9 and K' n D = 9, 
or K n D ^ 9 and K' n D is a line segment contained in D whose midpoint 
is D n H and whose length is equal to the length of K n D (in D). 



Figure 9.13.1.1 


Figure 9.13.1.2 
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9.13.3. Remark. Watch out for the fact that st// : K K is not contin¬ 
uous (with respect to the HausdorfT distance). This is seen in figure 9.13.3, 
where the symmetrization of the segment if is a segment (of varying length) 
contained in H, except when if reaches the vertical position; then its sym¬ 
metrization suddenly becomes a vertical segment. 



9.13.4. Proposition. For every H and if xue have 

i) aH(stH(if)) = stH(if); 

ii) C{stn{K)) = w, Steiner symmetrization preserves volumes; 

iii) diam(st//(if)) < diani(if), that is, Steiner symmetrization does not in- 
crease diameters. 

Proof. Property (i) is immediate from the definition; (ii) follows from 9.12.3 
and Pubini’s theorem. For (iii), let x,y €. st^ (if) be such that 

xy = diam(stH(if)), 

and let D,E be lines through x,y and orthogonal to H. By definition, the 
compact set DnK (resp. EnK) has endpoints x', x" (resp. y', y") such that 
x'x" > xaff(x) (resp. y'y" > y<JH(y))\ b™*- we have xy < sup (x'x", j/'y"). □ 

The next lemma follows from 9.13.2 and figure 9.13.5: 

9.13.5. Lemma. Let B be a ball, S its boundary, H a hyperplane containing 
the center of B and if a compact set contained in B. Then 

stH(if)n ((5\if)Ua«(S\if)) = 0. □ 

The next theorem will be put to essential use in 9.13.8 and 12.11.2; it provides 
an explicit way to locate a ball in a family of compacts. 

9.13.6. Theorem (The BLASCHKE KUGELUNGSATZ). Let T be a non¬ 
empty subset of K = K (A^), closed under the Hausdorff measure and invariant 
under Steiner symmetrization with respect to any hyperplane containing a 
fixed point a. Then either {o} ^ J or there exists r > 0 such that B(a,r) G 7. 


266 


Chapter 9. Euclidean aSine spaces 



Observe that when one enlarges a family so as to make it closed and invariant 
under symmetrization, the one-point set {a} can appear in the family, as 
shown by example 9.13.3. 

Proof. Set r = inf { s | there exists F € 7 contained in B(a, s) }, and consider 
the set 7' = /Ca.r+i(-X^) O 7 (cf. 9.11.4). FVom the assumptions and 9.11.4, 
this is a compact set, so there exists F €: 7' such that F C B(a, r). If r = 0, 
we have F = {a}; the proof will be completed by showing that we have 
F = B(a, r) if r > 0. 

9.13.6.1. First step. We prove by contradiction that F D 5 = 5(o,r). 
Assume there exist 6 G 5 and c > 0 such that B(b, e) D F = 0; we shall 
find an element F„ G 7 such that F„ n 5 = 0, contradicting the choice of r. 
Choose points 6,- by induction, so that bi = b and B(bi,e)nB(b i+i,£)n5 ^ 0 
for all i = 1,2.A finite number of such balb covers 5: 

5c Qs(&.,£). 

t = l 
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Figure 9.13.6.1 


Denote by Hi the plane equidistant from 6 and b,-, and define Fi by Induction 
as Fi = F, Fi = st//,(Ji_i) (t = 1,2,. ..,n — 1). Lemma 9.13.5 shows that 
n S = 0, and by assumption F„ S F, as desired. 

9.13.6.2. Second step. We show by contradiction by F = B(o, r). Assume 
X 6 B(a, r) \F; let D be an arbitrary line contuning x, and H the hjrperplane 
orthogonad to D and containing a. Since F C B[a,r), the length of D n F is 
strictly smaller than the length of DnB(a, r); this would imply that stjj(F) 
does not contain S, contradicting the first step-applied to st;/(F) £ F. □ 



9.13.7. Note. There is an older proof, perhaps more intuitive but also 
harder to formalize, that considers a finite number of linearly independent 
hyperplanes Hi passing throngh a and meeting at irrational angles. The 
subgroup of Is„(X) generated by the reflections u//, is dense in Is,i(X), so J 
contains a ball centered at a. 
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9.13.8. Corollary (Bieberbach’s isodiametric inequality). For 
every compact subset K of the n-dimetisional Euclidean space X we have 
(cf. 9.12.4.5) 

J1(K) < 2-"/9(n)(diain(/f))”. 

Proof. If il(/f) = 0, there is nothing to prove; otherwise set 

7 = {GeK \ £(G) > £(K) and diani(G) < diam(X) }. 

By 9.11.5, 9.13.4 and 9.13.6 (where a is arbitrary), either {o} G J or J 
contains a ball B(a, r) with r > 0. The first case is impossible since i!(ff) > 0; 
thus £(B(a,r)) = /9(n)r” > £{K), and 

diam(fl(a,r)) = 2r < diam(/f), 

achieving the demonstration since r > 0. □ 

9.13.9. Remarks. Observe that, in general, a compact set K is not con¬ 
tained in any ball of radius diam(A')/2 (consider an equilateral triangle, for 
instance). 

The isodiametric inequality clearly becomes an equality when K is a. ball. 
The converse is true but much harder to prove; see |EN, 106-107]. 

For applications and generalizations of the Steiner symmetrization, see 
[P-Sj, as well as 12.11. 

9.14. Exercises 

9.14.1. Generalize 9.2.6.5 to the case of two arbitrary subspaces. 

9.14.2. Write equations of the form f[x,y,z) = 0 (in an appropriately chosen 
system of coordinates) for the sets of points in three-dimensional space defined 
by the equation d[x, A) = d[x, B), where A and B are, respectively: two lines, 
a line and a point, a line and a plane, a point and a plane, and two planes. 
Classify the sets according to the terminology of 15.3 and 15.6. What happens 
if we consider the equation d(x,A) = kd[x, B) instead? 

* 9.14.3. Bisectors. Let D, D' be two non-parallel lines in a Euclidean plane 
X‘, show that { X G X I d(x,D] = d(x, D') } is formed by the two bisectors of 
D,D‘. What is this set wheu X is higher-dimensional? (Observe that D and 
D' may not intersect.) 

9.14.4. Let X be a Euclidean affine space. Determine all maps / G GA(X) 
such that /^ = / and / does not increase distance (i.e., f[x)f(y) < xy for all 
x,y e X). 

9.14.5. Let X be three-dimensional and oriented. Define the oriented angle 
of a rotation so as to characterize rotations unambiguously by their oriented 
axis and oriented angle. 
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9.14.6. Let X be three-dimensional and take points (a,) and (aj) (i = 1,2,3) 
in X such that OjOy = ojo' for all = 1,2,3. Assuming the are affine 
independent, show that there exists a unique / G Is'*'(X) (resp. Is~(X)) 
such that /(oi) = a'- for all t. Find a geometric procedure to obtain the 
characteristic elements of / (cf. 9.3.5). 

9.14.7. Continuous screw motions. A continuous screw motion of 
a three-dimensional Euclidean affine space AT is a one-parameter family of 
isometries / : R —* Is(Ar) that can be expressed in an appropriate orthonor¬ 
mal frame as 

/(t) : (x,y,z) I—» (cos kt ■ x + sin kt ■ y, —sin kt ■ x + cos kt ■ y,z + ht), 

where k,h are scalars. Show that, for any map p : R —♦ Is(Ar) of class C* 
(consider Is(Ar) as a subset of GA(Ar) to define differentiability), there exists 
a continuous screw motion / such that (g — /)'(0) = 0. Determine / when g 
is the map defined by the tangent, normal and binormal vectors of a curve. 


Figure 9.14.9 

9.14.8. Show that the isometry / in 9.4.2.3 is never the identity. While at 
it, make a critical study of [GR, 179 ff.]. 

9.14.9. Study billiard trajectories that close after going around k times, 
according to the value of k and the number of sides of the polygon. See a 
beautiful picture in [GR, 176]. 

9.14.10. Make a complete study of least-perimeter polygons inscribed in a 
quadrilateral whose vertices lie on a circle. 

9.14.11. Show that rectangular billiards are never strongly ergodic, but 
always weakly ergodic. Show that an equilateral triangle is not strongly 
ergodic. 


See also [SY, 282). 
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9.14.12. Study the structure of similarities in three dimensions. 

9.14.13. In dimension two, take four points a,a',b,b' such that a ^ b and 
a' ^ b' . Show that there exists a unique / G Sim'"(X) such that f(a) = o' 
and f{b) = b'. Construct its center and axis geometrically. 

9.14.14. Use 9.7.6.5 to find a geometric method different from 9.6.2 for the 
construction of the map / G Sim'*’ (AT) (resp. Sim“(Jf)) such that /(o) = o' 
and /(b) = b'. 

9.14.15. Harmonic QUADRILATERAL. Let y, a, t be points in a Euclidean 
plane such that [x,y,z,t\ = — 1 (cf. 9.6.5.2). Let o (resp. 6) be the midpoint 
of X and y (resp. z and t). Show that x,y,z,t are cocyclic, that the pole of 
(x, y) lies in (z, t), that (x, y) is the bisector of (o, z) and (o, t), that az ■ at = 
ax^ = oy^, that xy ■ zt — xz ■ yt + xt ■ yz and. that az + at = bx + by. Show 
that, if u is the intersection of {x,y) and (z,t), we have 



State and prove converses for these statements. Show that inversions (cf. 10.8) 
transform harmonic quadrilaterals into harmonic quadrilaterals. 



* 9.14.16. Inscribe a square inside a given triangle (figure 9.14.16). 

9.14.17. Discuss the form and relative position of the Pascal limaQons in 
figure 9.6.8.0.4, according to the relative position of the two circles. 

* 9.14.18. Study the convexity characteristics of limaQons (figure 9.6.8.0.3) as 

the position of the point a varies with respect to the circle. (Hint; use the 
formula + 2p'^ — pp", which gives the concavity of the curve p(d) in polar 

coordinates.) 
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Figure 9.14.16 

9.14.19. Let C,C' be two subsets of the Euclidean plane X such that, for 
any a, there exists a similarity / of angle a taking C into C. What can be 
said about C and C*? 

9.14.20. Are there any differentiable group homomorphisms R —» Sim,|J^(Ar) 
other than the ones in 9.6.9.1? 

9.14.21. Logarithmic spirals. What are the plane curves whose tangent 
makes a constant angle with the line joining the foot of the tangent to a 
constant point? What are the plane curves whose radius of curvature is 
proportional to arclength? Can a logarithmic spiral coincide with its evolute? 

9.14.22. Pascal limaqons, Descartes ovals and stigmatic diop¬ 
ters. Let u and v be points and S a surface of revolution around the 
axis (u, v). Show that a necessary condition for S, considered as a diopter 

satisfying Descartes’s law = constant, to be perfectly stigmatic for the 
two points u, V is that the points x of S satisfy a relation a ■ xu + b ■ xv — c, 
for some a, 6, c € R. A Descartes ova] is a plane curve of the form { a: £ A" | 
a ■ XU b ■ XV = c}, where u,v € X and a,b,c £ R are given. Show that 
Pascal limagons are Descartes ovals. Study the shape of Descartes ovals. For 
more details, see [BP]. 



9.14.23. CaYLEY-MENGER DETERMINANTS. Show that P satisfies the 
following relation, where M,fe is the cofactor of the entry in F; 

F( 2 ;(), . . . , |)r(x(), . . . , X,' , . . . , Xfc ) 

= Mffc -I- r(xo,..., Xi, ..., Xfc_i)r(xo,. .., Xfc). 
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Deduce that, if r(a:o, • • • i ®fc) is non-zero and has sign (—all the subde¬ 
terminants r(a:i,. Xii^) (ii < ■ ■ ■ < ih, h = 2 . k) are non-zero and have 

sign (—1)^‘. Show that a necessary and sufficient condition for the existence 

of points (a:i)i_o.i. k in X such that XiXj = dij for i, j = 0,1.fc is that 

all the determinants r(a:,-,. Xi^) (tj < < ihih = 2,...,fc) either be 

zero or have sign (—1)^. 

9.14.24. Give a simple criterion to guarantee that the case {a} £ 7 does not 
occur in 9.13.6. 

* 9.14.25. Sylvester’s THEOREM. Prove that if a set of n points (a:i)i=i.„ 

on an affine plane has the property that every straight line containing two 
of the Xi also contains a third, then all the points are on the same line. 
(Hint: introduce an arbitrary Euclidean structure and consider a triple of non- 
collinear points such that the distance from one to the line passing through 
the other two is minimal in the set of all such distances.) 

9.14.26. Find / G Is(Ar) and a bounded subset A of X such that f(A) C A 
but f{A) / A. Show that if A is compact f(A) C A implies f(A) = A. 

9.14.27. Show that Is^(Ar) C Isj(Ar). Find an example where the two 
groups are different. 

9.14.28. Find examples of unbounded (resp. infinite) sets A such that Isyt(Ar) 
is compact (resp. finite). 

9.14.29. Plane lattices. A httke of is a subset of the form Zi-t- Zy, 
where a, y are linearly independent (cf. 1.7.5.2). Show that every lattice A of 
R2 is similar to a unique lattice of the form Zu -t- Zv, where u = (1,0) and v 
belongs to the region 

P = { (a, 6) I 0 < a < ^, -t- > 1, fc > 0 }. 

Study Is>i(R^) according to the position of u in P. Show that two lattices A 
and A! of R^ satisfying 

#{ a: e A 1 l|x|| = r } = #{ I e A' 1 ||z|| = r } 

for all r £ R.). are necessarily isometric. This is not true for lattices in higher 
dimension; see [SE2, 177, lines 4-8). 
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9.14.30. Intrinsic metrics. Prove 9.9.7.1. Study the metric d, where 
(M, d) is the following metric space: d is the distance induced from X on the 
cone M with vertex O and containing the curve given in 9.9.3.3 (drawn on ? 
plane P not passing through O). Find an example to show that if (M,d) ii 
such that d is still a distance, the topologies of [M,d) and (M,d) may differ. 

9.14.31. MENGER curvature. Let / be a regular curve of class in a 
Euclidean space. Let z, y, z be distinct points of /, and set 

y/(xy + j/z + zx)(xy + yz - zx)(xy - yz + zx)(xy - yz + zx) 

— xy yz ■ zx 

(cf. 10.3.4). Show that, as y and z approach x on /, (z, y, z) tends towards 

the curvature of / at z (see [B-G, 8.4j, for example). Find examples of regular 
curves of class for which K(x,y,z) does not have a limit, or becomes 
infinite, as y and z approach z. 

9.14.32. (This exercise is optional, as the author cannot take upon himself 
the costs involved.) Visit the Basel cathedral and decide whether the curve 
engraved on Bernoulli’s tomb is really a logarithmic spiral. 

* 9.14.33. Let C be a plane curve in a Euclidean plane, of class and strictly 
convex. Show that for any integer n > 3 there is at least one n-sided light 
polygon inscribed in C. 

9.14.34. HYPOCYCLOIDS AND EPICYCLOIDS (a.k.a. Spirograph). 

9.14.34.1. Definition. A hypocycloid (resp. epicycloid) is the set C of 
points of the Euclidean plane occupied by a given point on a circle F' that 
rolls (without sliding) outside (resp. inside) a fixed circle F of commensurable 
radius (figure 9.14.34.1). Study the shape of C according to the ratio between 
the radii. How many cusps does C have? 

9.14.34.2. Equivalent definitions. Let E be the unit circle of R* = C, 
and let r ^ 0 be rational. Show that the envelope of the lines D{0) joining 
the points e’*' and e”'*', for 5 e R, is a hypo- or epicycloid. Discuss what kind 
it is and how many cusps it has, according to the value of r. Are all hypo- 
and epicycloids obtained in this way? 

9.14.34.3. Examples. 

A) What does the figure look like when F' has half the radius of F and rolk 
inside F? (It is called Lahire’s cogwheel.) 

B) Show that, if F' and F have the same radius and F' rolls outside F, one 
obtains a Pascal limagon, called a cardioid. Let a and D be a point and a 
line in the plane, with a^D\ show that the envelope of the family of lines 0 
defined by m G 0 and Z>0 = 3ma, D, for m G D, is a cardioid, whose center 
is, by definition, the point a. 

C) Show that the caustic of a plane spheric mirror (that is, the envelope 
of the light rays reflected from a beam parallel to the axis) is a piece of a 
two-cusped epicycloid (called a nephroid). 
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Figure 9.14.34.3.C 



Figure 9.14.34.3.D 
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D) Let T be a triangle. Show that the envelope of the Simson lines of T 
(cf. 10.4.5.5, 10.9.7.1) drawn from a point that describes the circle circum¬ 
scribed to T is a three-cusped hypocycloid. Show that the circle defining 
this hypocycloid is tangent to the nine-point circle of T (cf. 10.11.3) at three 
points; determine these three contact points. 

E) Show that horizontal projections of spherica.1 helices with a vertical axis 
(that is, curves on the sphere whose tangent makes a constant angle a with 
the axis) are epicycloids for appropriate values of a. 




F) Show that the envelope of a segment of fixed length whose endpoints 
describe two orthogonal lines is afour-cusped hypocycloid (called an astroJd). 

9.14.34.4. Properties. Show that the evolute of a hypo- or epicycloid is 
similar to the original curve. Show that the arclength s (computed from an 
appropriated starting point) and the curvature /f of a hypo- or epicycloid 
satisfy a relation of the form 

as^ -f- bK~^ = c, 

for a,b,c constant (this is called the intrinsic equation, cf. [B-G, 8.5.7)). 
Conversely, what are the curves satisfying such a relation? Find the total 
length of a hypo- or epicycloid. 

9.14.34.5. Cardioids and Morley’s theorem. Let T be a triangle. Show 
that the centers of all cardioids tangent to the three sides of T form 27 
lines, lying in three directions making an angle of 2n'/3 (use 9.14.33.3.B and 
10.13.18). Show that the sides of the Morley triangle of T (cf. 10.3.10) lie 
on three of those lines (notice that each vertex of the Morley triangle is the 
center of a cardioid lying inside T and tangent to all its sides, one of them 
twice.) See also 10.13.23. 

9.14.34.6. For more information on hypo- and epicycloids, see [LMl]. See 
also |ZR], a very agreeable text on plane curves au their links with mechanics, 
optics and electricity; in particular, see chapter XXI, where the connection 
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between epicycloids and cogwheel design is discussed. Finally, [LF-AR, 413- 
435] gives an analytic presentation of the various cycloids, including, in the 
last three pages, a derivation of the shape of the carter of the Wcinkel engine 
(cf. 12.10.5). 

9.14.35. Let a,b,c,d be points in a Euclidean plane, and assume b,c,d to be 
collinear. Prove the Stewart relation 

(ofc)^ cd + (ac)^ db + (ad)^ be + be ■ cd ■ db — 0, 

where the bars stand for signed distances on the line containing b,c,d. 

9.14.36. Let X be a Euclidean affine space and G a finite subgroup of Is(X). 
Show that there exist subsets A of AT such that G = Isy 4 (Ar). Deduce that, 
for any finite group (7, there exists a Euclidean affine space X and a subset 
B of X such that G is isomorphic to Isi 3 (Ar). 

9.14.37. Let ^ be a map from a Euclidean plane E into itself that “preserves 
the distance 1”, that is, such that d{^4>{x),d>(y)) — 1 whenever d{x,y) = 1, 
for any x,y E E. Show that E is an isometry. (Hint: Show first that <fi 
preserves the distance \/3i a.nd use the fact that Z “f Z'\/3 is dense in R). See 
[M-P, 152|. 

9.14.38. Show that a necessary and sufficient condition for three lines D, E, F 
of a three-dimensional Euclidean space to have a common perpendicular 
(cf. 9.2.6.5) is that the composition od oap of the reflections through 
these lines also be a reflection through a line. Deduce the following result, due 
to Petersen and Morley: If X,Y,Z are three lines in Euclidean three-space, 
X' (resp. Y', Z') is the common perpendicular to Y and Z (resp. Z and X, 
X and Y) and X" (resp. Y”, Z") is the common perpendicular to X and X' 
(resp. Y and K', Z and Z'), the three lines X", Y" and Z" have a common 
perpendicular. See other proofs in [LF-AR, 681] and |FL, 339]. 

9.14.39. Given six points a,, a( (t = 1,2,3) of satisfying d(oj,oy) = 
d(a|.,a' ), find a geometric construction for the isometries / taking a< into a\ 
for all i. 

* 9.14.40. Given four points a,b,a',b' in R^, construct the centers of the 
similarities taking a to a' and b to 6'. 

9.14.41. Given six points a,,a' (i = 1,2,3) of R^, give necessary and suf¬ 
ficient conditions for the existence of a similarity / taking a, to a' for all i. 
Assuming those conditions are satisfied, find a geometric construction for the 
center of the similarities /. Extend to arbitrary dimension. 

9.14.42. Given three circles in a Euclidean plane, study the figure formed by 
the centers of the homotheties taking one of the circles into another. Same 
study for four spheres in three-space. Same question for similarities. 

9.14.43. Take a plane triangle, and consider the three equilateral triangles 
built on its sides and outside it. Show that the centers of these three triangles 
form an equilateral triangle. 
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9.14.44. (Napoleon’s theorem). Take a parallelogram and consider the 
four squares built on its sides and outside it. Show that the centers of the 
squares form a square. 



Chapter 10 

Triangles, spheres and circles 


This chapter deals with the domestic animals of geometry: 
triangles, polygons, tetrahedra, circles, spheres. We have col¬ 
lected here basic definitions and results, but also more intricate 
results that have a simple statement—or illustration. A presen¬ 
tation for this chapter seems unnecessary; the reader who leafs 
through it will glean from it whatever strikes his fancy, and will 
turn back when necessary for enlightenment on notation or facts 
used in the proofs. In any case, we do not tarry overlong on 
the proof of cla.ssical results, as they are likely to be familiar to 
the reader, and the purpose of the chapter is just to provide a 
number of illustrations, exercises and problems on the material 
covered in chapter 9. Section 10.12 serves as introduction and 
motivation to material to be covered later. 

See [COO] for more on these domestic animals. 
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All spaces considered are Euclidean affine. The dimension is two in sections 
10.1 to 10.5 and 10.9 to 10.11, and three in section 10.12. 


10.1. Triangles: definitions and notation 


10.1.1. According to 2.4.7, a triangle consists of three affine independent 
points x,y,z. By restricting to the affine plane spanned by the three points, 
we can assume that we’re working in a Euclidean plane X\ we do so in sections 
10.1 through 10.4. 



Figure 10.1.1 


10.1.2. Let T == {x,y,z} be a fixed triangle. We shall make consistent use 
of the following notation: 


a = yz, b = zx, c = xy (cf. 9.1.1), 

A = xy, xz, B = jjsc, yz, C = iy, zx (cf. 8.6). 

The points x, y, z are called vertices. The sides of T are either the segments 
joining each pair or vertices (cf. 2.4.7) or their lengths o, fc, c (or yet, by a 
harmless abuse of language, the lines supporting them). The angles of T are 
A,B,C, real numbers in ]0, ir[. 

10.1.3. A triangle is called isosceles if two of its sides are equal, equilateral 
if all three are. It is called^a right triangle if one of its angles is n/2, an acute 
triangle if all its angles are in ]0,x/2|, and an obtuse triangle otherwise. 


10.1.4. The altitudes of T are the lines containing one vertex and perpendic¬ 
ular to the opposite side; the length of the corresponding segments is denoted 

by h„, hi,, he (figure 10.1.4.1). The medians of T are the lines ^ ^ 

/z-|-z\/z-f-y\ 

f y, —), f z, —-— ), and the lengths of the corresponding segments are 
denoted by me (figure 10.1.4.2). The interior bisectors of T are the 


lines containing the bisectors of the oriented lines (xy, xz), (yz, yz), (zz, zy); 
the exterior bisectors are the lines perpendicular to the interior bisectors at 
the corresponding vertices. The length of the segments determined by interior 
and exterior bisectors is denoted by ia,ib,‘ic and ea,e(„ec (figure 10.1.4.3). 
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Figure 10.1.4.3 


10.1.5. By 9.7.5, there exists a unique circle circumscribed around T, whose 
radius we denote by R (figure 10.1.5.1). There exist four circles tangent to 
the three sides of T; one inside the triangle, called the inscribed circle, and 
three outside. Their radii are denoted by r, r„,r;,,r,. (figure 10.1.5.2). Finally, 
5 stands for the area of T (cf. 9.12.4). 



Figure 10.1.5.1 
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Figure 10.1.5.2 

10.2. Classical results 

10.2.1. Existence. For there to be a triangle {x,y,z} with sides xy = c, 
yz — a and zx = 6, it is necessary and sufficient that x, y, and z satisfy the 
strict triangle inequalities 

a<6 + c, 6<c + o, c < a + b, 

which can be summarized as 

[6 — c| < o < fc + c. 

In fact, formula 10.3.1 shows that it is enough to find A € [0,5r[ such that 
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and this can be done if and only if this fraction is in ]—l,l[. For another 
proof, use 9.7.3.4 and 9.7.3.8. 

10.2.2. Isosceles triangles. For the triangle T to be isosceles, it is nec¬ 
essary and sufRcient that two of its angles be equal. This follows from formula 
10.3.1; necessity is trivial, and for sufficiency, observe that the condition 

b^ + c^~ H- tt^ - 9 

2bc 2ca 

can be written as (a — 6) ((a -I- b)^ — c^) = 0, and 10.2.1 forces o = fc. It 
follows that a triangle is equilateral if and only if its three angles are equal; 
they must have the value 7r/3 by 10.2.4 or 10.3.1. 

10.2.3. Pythagorean theorem. A necessary and sufficient condition for 
T to have a right angle at x is that = b^ + (cf. 9.2.3 or 10.3.1). 

10.2.4. Sum of the angles. One always has A -t- B -t- (7 = jr. In fact, we 
can apply 8.7.5.3; xz is between xy and yz, and similarly j/z is between xz 
and yx, so we have 


xy, xz + xz, yz yz, yx = xy, yx = n. 

We shall encounter geometries where the sum of the angles of a triangle is 
always greater than n (cf. 18.3.8.4) or less than n (cf. 19.5.4). 



10.2.5. Concurrence. Thereader may have remarked in figures 10.1.4 and 
10.1.5 that the altitudes are concurrent, and so are the medians, the interior 
bisectors (or one interior and two exterior bisectors), and the perpendicular 
bisectors of the sides (9.7.5). This follows from 3.4.10 for the mediums, from 
9.14.3 for the bisectors and from 9.7.5 for the perpendicular bisectors. Only 
the altitudes require more thought; one solution is to consider the triangle T' 
whose sides are the parallels to the sides of T passing through the opposite 
vertex; the altitudes of T become the perpendicular bisectors of the sides of 
T'. See also 10.13.1 and 17.5.4. The intersection point of the altitudes of T 
is called the orthocenter of T. 
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10.2.6. Congruence. Let T, T' be triangles whose elements (cf. 10.1.2) we 
denote by unprimed and primed letters, respectively. The following conditions 
are equivalent: 

i) there exists / G Is(Ar) such that f{x) = x', f(y) = y\ f(z) = z'\ 

ii) a = a', b = b\ c = c'; 

iii) A = A\b = b',c = c'; 

iv) A = A\ B = B', c = c'. 

This follows from 9.7.1 and the formulary below. Watch out, though, 
when using the formulary: the sine of an angle is not enough to uniquely 
determine the angle in the interval ]0, xj, though equality between cosines 
can be used to show equality between angles. 

Triangles that satisfy the conditions above are called congruent. 

10.2.7. Similarity. With the same notation, the following conditions are 
equivalent: 

i) there exists / € Sim(Jf) such that f{x) — x', f(y) = y', f(z) = z'\ 

... a' b' c' 

" - = r = 

a b c 

iii) A = A', B = B',C = C. 

Use 9.5.3.1 and the formulary to see that (i) =>■ (iii) => (ii). To see that 
(ii) (i), use a similarity of ratio a'fa to reduce the problem to congruent 
triangles. 


10.3. Formulary 

In the formulary below we use the notation introduced in 10.1, plus the 
symbol p for the haJf-periiiieter of T, defined as p = (o + fc + c)/2. The 
triangle {x,y,z} is arbitrary. 


lO.S.l 


cos A — 


62 + c2 - a2 
26c 


sinA= ^\/p(p-o)(p-6)(p-c) 


. A 

Sin — = 
2 


(p-6)(p-c) 

be 
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10.3.2 -7^ = - 2R 

sin A sm B sin c 

10.3.3. (Heron’s formula) 


s = -aha - -be sin A = pr = \/p(p- o)(p - fc)(p - c) 


10.3.4 


R = 


abc 

\/(a + fc + c)(a + fc — c)(o — fc + c)(—o + fc + c) 


10.3.5 



10.3.6 


b + c 


v/p(p - a)bc. 


|fc-, 


-y/(p-b)(p- c)bc 


10.3.7 


2(fc^ + c^) - tt^ 
4 


The first formula in 10.3.1 follow s from 8.1. 2.4 by applying 8.6.3 and 9.1.1. 
The second comes from sin A = v 1 — cos^ A and the third from 


. 2 A 1 — cos A 

sm — =---. 

2 2 

For 10.3.2, let w be the center of the circle circumscribed around T (figure 
10.3.2). By 10.9.3, the triangle wyz has angle 2A at w; by definition of sine, 
a = i/JsinA. If you don’t want to resort to 10.9.3, use 9.7.3.7, which also 
gives 10.3.4 and the last equality in 10.3.3. 



The first equality in 10.3.3 is just 9.12.4.4, and the second comes from 
hi, = c sin A. The third is obtained by dividing T into three triangles; 

T = {a:,p,a} U {y,z,a} U {z,x,a}, 
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Figure 10.3.3.1 Figure 10.3.3.2 


where a is the center of the triangle inscribed in T (figure 10.3.3.2). Since 
each of these has altitude r and the opposite sides are a, b, c, the equality 
follows from 9.12.4.4. The last equality is a consequence of 10.3.1. 

Elquality 10.3.4 could be obtained from 9.7.3.7, as mentioned; but it also 
follows directly from 10.3.1 and 10.3.2. 

For 10.3.5 we use the same trick that shows S = pr, choosing the center 
a' of the exinscribed circle (figure 10.3.5). Here T U {y,z,a') = (a:, y,Qi'} U 
{x,z, a'}, so 

area(T) = area({a:, y, a'}) + area({x, 2 , a'}) — area({y, 2 , a'}). 

The three triangles have altitude and sides b,c,a, respectively. 



Figure 10.3.5 


Formula 10.3.7 is just 9.7.6.5. As for 10.3.6, we leave e„ to the reader, 
and find i„. Let s be the point where the bisector from x intersects the side 
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[y,z]. Applying 10.3.2 and observing that the angles at s of the triangles 
{s, j/, z} and {s,z,x} have the same sine, we get 


10.3.8 


We now apply 10.3.2 


sy sz sy + sz a 
b c 6+c b + c 

to the triangle {x,s,z}, for example, and 10.3.1 twice. 


X 



10.3.9. Note. It is a good exercise to find out what these formulas become 
for right triangles. 

10.3.10. Application; MorlEY’S theorem. This is an apparently 
simple result, but its proof is not obvious; the reader is encouraged to attempt 
a geometric proof, or even a trigonometric one, before reading what follows, so 
as to appreciate its difficulty. Exercises 10.13.4 and 9.14.34.5 give geometric 
proofs; 10.13.23 is a related exercise. See |LB1, 173-194] for the figure formed 
by all the trisectors of a triangle, and |CAL| for a generalization to polygons. 


X 



Figure 10.3.10 
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Take a triangle {x,y,z} and trisect its angles; let p (resp. q, r) be the 
point where the trisectors adjacent to side o (resp. fc, c) intersect. Morley’s 
theorem says that the triangle {p,q,r} is equilateral. 

Set Q = A/3, P = 3/3, 7 — C/3, and apply 10.3.2 to {x,y,z} and 
{z,y,r}. We obtain 

sin /9sin(3a 

TX — 2R -:—;- - - 1 

sin(Q + P) 

using 10.2.4 also. A trigonometric calculation (cf. 8.7.8 and 8.12.8) using 
a + P + 'I = n/3 leads to 

rx = 8 i?sin/9sin 7 sin (if/3 + 7 ). 

Similarly, 

qx = 8 A sinsin 7 sin (‘if/3 + P), 

so that 

rx qx 

— -I—r -T = ^= 8 /? sin 5 sin 7 ; 

sin (k/3 + 7 ) sin (ir/3 + P) 

but since (ir/3 + 7 ) + (jr/3 + P)+ct = Jr, formula 10.3.2, applied to a triangle of 
angles ( 5 r /3 + 7 ), ( 7 r /3 + P) and a and one side equal to rx, shows that such a 
triangle is congruent to {z, r, 9 }. Then 10.3.2 implies rq = 3R sin a sin P sin 7 , 
and this must also be the value of qp and pr. 

10.3.11. The geometry of the triangle. There is an immense number 
of results on triangles; see the wonderful book |COO], and also |R-C, volume 
I, note III]. Some examples are given in 10.11. 

10.4. Inequalities and problems of minimum 


10 . 4 . 1 . The ISOPERIMETRIC inequality. For every triangle we have 
S < p^/3\/3, and equality holds if and only if the triangle is equilateral. 
Proof. It is enough to apply 10.3.3 and 11.8.11.6 to obtain 

(P-p)(P-H(P-C) < 

equality is satisfied if and only ifp — o=:p — fc = p — c. □ 

10.4.2. Note. We shall encounter two generalizations of 10.4.1, one in 
10.5 and one in 12.11.1. As for cui “isodiametric” inequality for triangles 
(cf. 9.13.8), it follows immediately from 10.3.3, since the diameter of T is 
sup {o, b, c). 

10.4.3. Fermat’S problem. From 9.7.6.3 we deduce that 

tx^ + ty^ + tz^ > gx"^ + gy^ + gz^, 

where g = (z + p + z)/3is the center of mass of T and z is arbitrary. Equality 
holds only when t = g. The value of the minimum follows from 10.3.7 and 
3.4.10. 
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Finding the minimum of the function t tx + ty + tz is, surprisingly 
enough, a much more complicated problem (the reason being that the distance 
function is not bilinear, whereas its square is); this is the so-called Fermat 
problem. The solution is indicated by 9.10,6; we are looking for points t of X 
from which the three sides of T are seen under an angle 25r/3. More precisely, 
either some angle of T, say A, is > 2v/3, and then the minimum is achieved 
for t = X only; or all the angles of T are < 25r/3, and then there exists a 
unique point w G AT from which the three sides of T subtend an angle 2rc/i, 
and the minimum is attained for t = w only. 

To begin with, the point t where the minimum is reached must be in the 
closed triangle, for otherwise the foot u of the perpendicular dropped from t 
onto a side {y, z) of T such that t and x are on different half-planes would 
satisfy tx -i- ty -i- tz > ux uy -i- uz (apply 9.2.2). 

If A > 27r/3, for every t ^ x in the interior of T, the angle at t of 
{t, y,z} is > 2rr/3, and t cannot minimize the sum. Since there is a minimum 
by compactness, it must be at x. 



Now assume that all three augles of T are strictly less than 2jr/3. Assume 
X oriented, and let / be the rotation with center y and angle 7r/3. Set 
x' = f{x), t' = f(t) for t € X arbitrary. Then 

tx -i- ty -i- tz = x't' -|- t't + tz > x'z, 

and equality holds if and only if x',t',t,z are aligned and in the right order. 
Next we construct z" = /“'(z); since the angles if T are < 25r/3, the lines 
{x,z") and {z,x') intersect at a single point w, such that = f{u)),u),z 

are aligned and in the right order. Thus w satisfies ujx -|- u)y + u)z = x'z, 
concluding the proof. 

This incidentally shows that the three lines (x',z), {x,z") and (y, x") in 
figure 10.4.3.2 intersect in the same point, forming three angles 5r/3. 
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10.4.4. Note. We have studied in section 9.4 the problem of least-perimeter 
triangles inscribed in T. 

10.4.5. Area of pedal triangles. If T = {x,y,z) is a triangle and 
I € X a point, we define the pecfa/ triangle of t as the triangle {p, g, r} formed 
by the orthogonal projections of t onto the sides of T (this is an abuse of 
language, for p,q,r can be collinear). We are mterested in the area of pedal 
triangles, for example, for what position of t inside T b the area maximized? 
To do this we need to introduce oriented triangles and their area. 

10.4.5.1. Definition. An oriented triangle in an oriented plane X is a 
triple {x,y,z) of points of X. The (signed) area of (x,y,z) is A(x,y,z) = 
^X^(xy,^) (cf. 8.11.3). 

It is immediately seen that A is invariant under cyclic permutations and that 

10.4.5.2 A(x,y,z) = A(t,x,y) + A(t,y,z) + A(t,z,x). 

Further, if o € Is“(Jf), we have A(cr(a:), (r(p), cr(z)) = -A{x,y,z). 

Now back to our triangle T and our point t in the plane X, assumed ori¬ 
ented. In addition to p,g,r, we introduce the reflections p‘,q',r' of t through 
the sides of T. Then, if T has positive area, 

10.4.5.3 A{p, q,r) = -A(x,y,z) -(sin 2A ■ tx^ + sin 2B ■ ty~ + sin 2C ■ tz^). 

2 8 

Proof. Refer to figure 10.4.5.3, where one possible relative position for the 
points is given; the others are treated analogously. The idea is to decompose 
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Figure 10.4.S.3 Figure 10.4.5.4.1 


the hexagon {x,r',y,p', z,q'} into triangles in two different ways; {a:, j/, f}, 
{y,z,t} , {y,z,p'}, {z,x,t}, {z,x,g'} on the one hand, {p',g',r'}, 
{x,r',q'}, {y,p',r'}, {z,q',p'} on the other. In algebraic terms we have 

2A(x,y,z) = 2A(t,x,y) + 2A{t,y,z) + 2A(t,z,x) 

= A(^,^,y) - A{r,x,y)+ A(t,y,z) 

- Mp',y,^) + ~ ■^( 9 '.^.®) 

= A{p',q',r') + >l(a;,r',g') + A{y,p',r') + A{z,q',p’). 

But A{p',q',r') = 4A(p,q,r) because {p',q',r'} can be derived from {p,g,r) 
by a homothety of center t and ratio 2, and 

A(x, r , q') = - sin 2A ■ xr ■ xq' = - sin 2A • tx^ 

from 10.3.3 and 8.7.7.8; this implies 10.4.5.3. □ 

By 10.4.5.3 and 9.7.6 the least-area problem will be solved if we know the 
barycenter of the punctual masses (sin 2.4,z), (sin2B, p), (sin2C,z). But this 
is none other than the center u) of the circle circumscribed around T; it is 
enough to check that 

sin 2 A ■ wx -f- sin 2B ■ uJy sin 2C • uJl = 0 
(cf. 3.4.6.5), and this follows from the proof of 10.3.2. 

10.4.5.4. Without delving too deep into the problem, we can still extract 
some conclusions from 9.7.6.2; for certain values of k, the locus of the points 
of X for which the pedal triangle with respect to T has (unsigned) area k is 
made up of two circles of center w (figure 10.4.5.4.2); for other values of k, of 
one circle only. The following unexpected fact comes up as a bonus (figure 
10 . 4 . 5 . 4 . 1 ): the locus of the points of X such that p,q,r are collinear is the 
circle circumscribed around T (the “Simson line”, see 10.9.7.1). In fact, p,q,r 
collinear means A{p, q, r) = 0, so the locus is a circle of center ai; but this circle 
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must go through x, y,z, which satisfy the zero area condition; this proves the 
assertion. We also get the formula 45 = /?^(sin 2 A + sin 2B + sin 2C). 

10.4.5.5. We mention that, as x describes the circle circumscribed around 
T, the Simpson line envelopes a three-cusped hypocycloid (9.14.34.3.D), thus 
revealing a symmetry of order three in any triangle. 

10.4.6. The theorem of Erdos-MoRDELL. The next theorem has an 
interesting history: it was conjectured in 1935 by Erdos and proved in 1937 
by Mordell, but the methods used were not at all elementary, in spite of 
the simplicity of the statement. It was only in 1945 that D. K. Ka.zarinoff 
gave au elementary demonstration. Here we present the theorem’s statement 
only; the reader can amuse himself by tackling the demonstration, or read 
the whole story in [KF, 78] or [FTl, 12). 

10.4.7. Theorem. Let T be a triangle and t a point in the interior of T ■ 
For p, q, r as above, we have 

tx + ty -f- tz > 2(tp + tg + tr), 

and equality holds if and only if T is equilateral. 

10.4.8. Notes. For other inequalities conceriiin-g triangles, see 10.13.5 
and [GSj. 
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10.5. Polygons 

We just give a few references on this subject. In 9.4 we have already 
studied convex polygonal billiards. For a polygonal analogue of 10.4.3, see 
10.13.8. 

10.5.1. ISOPERIMETRIC INEQUALITY FOR POLYGONS. The generalization 
of 10.4.1 is the following: if .P is a convex n-sided polygon with area S and 
perimeter P, we have 

g < _1_p2 

— 4ntan(3r/n) ’ 

and equality holds if and only if P is regular. For a proof and refinements, 
see 12.12.15, [FTl, 8-11] or [GR, 192). 

10.5.2. If P is an n-sided convex polygon, the sum of the angles of P is equal 
to (n— 2)ir (cf. 12.1.12). 

To see thb, use the fact that the polygon is convex to divide it into n — 2 
triangles, starting from an arbitrary vertex. Applying 10.2.4 to each of the 
small triangles and adding up, we find that the sum of all the angles of P is 
(n — 2)5r. 



Figure 10.5.2 
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10.6. Tetrahedra 


We won’t talk a lot about tetrahedra, just enough to remark that there 
are things about them that have no counterpart for triangles. We discuss 
in detail one phenomenon peculiar to tetrahedra, arising from the study of 
spheres tangent to their four faces; we also mention two other particularities 
not shared with triangles. This is all done from 10.6.6 on, while 10.6.1 through 
10.6.5 present generalizations of results already studied for triangles. 

For references on the geometry of tetrahedra, the reader can consult 
[COO], [C-B] and [R-C, volume II, note IV]. (Incidentally, as can be seen 
from the preface of [C-B], already in 1935 there were teachers complaining 
that the geometric proficiency of students was declining. I’m offering a prize 
to anyone who can find an Egyptian tablet containing the same plaint...) 

10.6.1. Let T = {z, y, z, t} be a tetrahedron (cf. 2.4.7). In this whole section, 
we shall assume that T is in three-space. An edge of T is both a line like (z, y) 
and a distance like xy, similarly, a face means either a plane like {x,y,z), a 
triangle like {z, ^,z} considered in this plane, or the area of this triangle in 
this plane (cf. 10.1.5). The volume of T is denoted by V (cf. 9.12.4), and the 
radius of the sphere circumscribed around T by R (cf. 9.7.5). 

10.6.2. By 9.7.1, a tetrahedron is defined, up to isometry, by its edges. For 
the existence of a triangle with given edges, it is sufficient that the strict 
triangle inequality (10.2.1) be satisfied for each face, and that ?(,,,) be 
strictly positive (9.7.3.4). In fact (cf. 9.14.23), this latter condition implies 
the strict triangle inequalities. 

10.6.3. Medians. From 3.4.10 we deduce that the four medians of T, that 
is, the segments connecting a vertex to the center of mass of the opposite 
face, intersect. The length of the medians can be found by applying 9.7.6. 

10.6.4. Discussing angles at the vertices of a tetrahedron is more difficult 
and awkward, involving ideas we won’t discuss until chapter 18. It is easier 
to introduce the dihedral angles between faces, but we don’t do this here; see 
10.13.10. 


10.6.5. From 9.7.3.3, 9.7.3.7 and 9.12.4.3 we deduce V and i? as a function 
of the edges; 
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r(a:, y,z,t). 


1 T(x,y,z,t) 

2 A(z,y,z,f)' 


10.6.6. Altitudes. The altitudes of a tetrahedron, that is, the lines going 
through one vertex and orthogonal to the opposite face, do not intersect in 
general. See [C-B, 121] and [R-C, volume II, p. 643]. 


10.6.7. MOBIUS TETRAHEDRA. There exist pairs of tetrahedra such that 
each vertex of oue belongs to a face of the other. This phenomenon can 
be elegantly explained in the context of projective geometry, cf. 4.9.12 and 
14.8.12.4. 
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10 . 6 . 8 . Roofs and their spheres. We consider a tetrahedron T = 
{zi, Z 2 i 2 : 3 , 2 : 4 } and search for spheres S tangent to the four faces of T, that 
is, such that their center w is equidistant from the faces of T. A tempting 
analogy with figure 10.1.5 would lead us to believe that there exist five such 
spheres, one interior to T and four inside the frusta determined by T (see 
figure 10.6.8). 


the interior a frustum a trihedron a pair of opposite roofs 



Figure 10.6.8 


A more accurate analysis shows these are not the only possibilities. We 
can start by using formula 9.2.6.4; let /, (t = 1,2,3,4) be affine forms on X 
whose kernels are each a face of T. We can assume that ||/,-|| = 1 for aD t. 
Then w must satisfy 

|/l(w)| = |/2(w)| = |/3(w)| = \hHl 

which k equivalent to eight equalities between affine forms. By 2.4.8.5, thk 
gives us eight solutions in the “generic” case, that is, when the /, are linearly 
independent. Checking independence k non-trivial; we tackle the problem by 
introducing barycentric coordinates with respect to T. In such coordinates, 
the following lemma holds in ciny dimension: 

10.6.8.1. Lemma. Let (z,),=i.„ 4 .i be a simplex of X, and set 

Hi (X4 , . . . , X4 , . . . , X,, X ) 

(the i-th face of the simplex). Let V be the volume of (xv),=;i.„ 4 .i and o,- 

the volume of the simplex (xyjyj^i in Hi. Then the barycentric coordinates 
(A, ) of a point x S X with respect to this simplex are given by 

|A.|= ^d(x,//.) 

and we have A,- > 0 (resp. <0) if x and Xj lie in the same half-space (resp. in 
opposite half-spaces) with respect to Hi (cf. 2.7.3). 
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Proof. Let /,■ be an afRne form such that ||/i|| = 1, with H{ — f- ’(0) and 
fi(x{) > 0. By 9.2.6.4, we have 







In particular, taking x = Xi, we have d(xi,Hi) = /t( 2 :,), which is equal to 
nV/ai by 9.12.4.4. □ 

To proceed with our investigation, we divide the space into various pieces 
determined by the tetrahedron T: its interior, the set of x all of whose 
barycentric coordinates are > 0; four frusta, sets where three Ai are > 0 and 
one is < 0; four trihedra, where three Ai are < 0 and one is > 0; and six roofs, 
where two are > 0 and two are < 0. FVustra and trihedra are associated with 
a vertex each; roofs are associated with edges, and matched in pws. If r is 
the radius and u) the center of a sphere tangent to the four faces of T, the 
barycentric coordinates Ai of u) satisfy 


|AiI _ IA2 1 __ IA3I _ IA4I _^ 

fli 0.2 ^3 ^4 


Ai — A2 — A3 — A4 — 1. 


Thus we see there is exactly one sphere in the interior, of radius 3 V/(oi +02 + 
03 + 04); exactly one in each frustum, with radius 3^/(01 + 02 + 03 — 04) for the 
frustum adjacent to the face {x\,X2,xz), for example (this is because 04 must 
be less than Oi + 02 + 03, since 04 is the sum of the projections of the other 
faces onto H4,, and such a projection decreases area, cf. 9.12.4.9). The case 
of the roofs is different, because nothing can be said about Oi + 02 — 03 - 04, 
for example; all we know is that if there is a sphere in one roof, there is none 
in the opposite one. To deal with this, assume Oi > 02 > 03 > 04. If all four 
faces are equal (i.e., have the same area), there are no spheres in the roof; 
the total number of tangent spheres is five. If Oi = 02 > 03 = 04, there are 
six spheres in all, one being inside the roof Ai > 0 , A2 > 0 , A3 < 0 , A4 < 0 . 
If oi > 02 > 03 = 04 and Oi + 04 = 02 + 03, there are seven spheres, two in 
roofs; and, finally, there are eight spheres in all other cases. 

For more details on these spheres, see |COO] or |R-C, volume II, p. 653]. 
Here we just remark that Oi = 02 = 03 = 04 for the regular tetrahedron 
(cf. 12 . 5 . 4 . 1 ), but it is easy to construct non-regular tetraJiedra all of whose 
faces have the same area. 


10.7. Spheres 

In the next sections, we study circles and spheres in their environment, 
and their relationships among themselves. The sphere for its own sake is stud¬ 
ied in chapter 18. For more, see [COOj. We will implicitly assume dimX > 2, 
since in dimension one a sphere is jnst two points and not very interesting. 

10.7.1. Definition. The sphere of center a and radius r is the subset 
^(o, r) = {x €. X \ ax = r} (also denoted by Sx{a,r), is there is danger of 
confusion). If n = dimX = 2, we talk about a circle. 
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10.7.2. Intersection with a subspace. Let S = ^(o.r) c X, and let Y 
be an affine subspace of X. Denote by x the projection of o on F (cf. 9.2.4). 
Then, by 9.2.3, we have (figure 10.7.2): 

{ 0 if d(a, Y) = ax > r; 

{z} if d(a, Y) = ax = r; 

Sy (z, y/r^ — az^) if d(o, F) = oz < r. 





10.7.3. Note. Depending on the nature of the problem, it can be useful to 
exclude spheres of zero radius. For the sake of notational simplicity, we leave 
it to the reader to specify which case is being considered, the task not being 
an arduous one. 

10.7.4. Tangent hyperplanes. The subspace F is said to be tangent to 
S = iS'(o,r) (at the point F n 5') if d(a,y) = r. For each zG 5 there exists a 
unique tangent hyperplane to S at z, denoted by TxS; it is characterized by 
the fact that it contains z and is orthogonal to xa. (It can also be considered 
as the vector subspace z"*", cf. 18.1.2.4.) 
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This definition coincides with the notions of tangent hyperplanes stem¬ 
ming from other theories; for example, if S' is considered as a quadric, its 
tangent hyperplane in the sense of a non-degenerate quadric is the same as 
the one just defined (cf. 14.3.8). Again, if S is considered as a differentiable 
submanifold of X, the tangent space to S at z £ S, defined as the set of 
vectors tangent at x to (7* curves whose image is in X, coincides with our 
tangent hyperplane (cf. 18.3.3). 

10.7.6. Intersection of two spheres. Consider in X two spheres 
S = iS'(o,r) and S' = S'(a',r') with a ^ a' (that is, S and S' are eccentric). 
It follows immediately from 10.2.1 that 

— if aa' < |r — r'\ or aa' > r + r' we have S n S' = 

— if aa' = r + r' OT aa' = |r — r'\ there is a point x such that S' D S" = {z}, 
and ns = nS'; 

— if |r — r'l < aa' < r + r', there exists a hyperplane H, orthogonal to aa', 
such that Sn S'= Sr\H = S'r\H is a. sphere in H, of center {a,a')r\H. 

If aa' = r + r', we say that S and S' are externally tangent; if aa' = |r — r'|, 
they are internally tangent. In the third case we say that S, S' are secant. 



Figure 10.7.5 


10.7.6. Analytic geometry. The analytic geometry of spheres is simple: 
if we vectorialize X, the equation of S = S(o,r) is ||z — o|p = r^, or 

||z|p — 2(a I z) -I- ||o||^ — = 0. 

Conversely, the equation fc||z|p — (a|z)-t-/i = 0, where fc / 0 and h are scalars 
and Q is a vector of X (which we have vectorialized at au arbitrary point) 
represents the empty set, if ||ap — 4kh < 0; the point {—Q/2fc}, if ||q|P — 
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Akh = 0; and the sphere of center {—a/2k} and radius 
otherwise. 

The intersection 5 D 5' of two spheres (cf. 10.7.5) must be contained in 
the set of equation 

||z|p — 2(o I x) + ||o|p — = ||a:|p — 2(o' | x) + ||o'||^ - r'^, 

which turns out to be the hyperplane 2(a' — o 1 1 ) + ||o|p — ||o'|p + — r'^. 

For applications of analytic geometry to spheres, see 9.7.3.7, the whole 
of chapter 20 and [COO]. 


^ NP-4fch y 


10.7.7. Angle betwe en tw o spheres. If 5 = 5(o,r) and S' = S(a',r') 

intersect, the angle ^ = za, z? does not depend on z for z G 5 D S''. We call 
this the angle between S and S'; it is given by 


cos 4> = 


+ r'^ — oo'^ 

2iy ■ 


]i 4> = ir (resp. ^ = 0), S and S' are externally (resp. internally) tangent; if 
(/> = 7r/2, we say that S and S' are orthogonal, and we write S ± S'. 


10.7.8. Proposition. Given n + l spheres Si (t = l,...,n+ 1) in an 
n-dimensional space X and n + 1 angles (p, € |0, x], there exist at most two 
spheres that intersect each S, at an angle <pi (t = 1,..., n + 1). 



Figure 10.7.8 
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Proof. Assume Si = and let S = [xo,R) be the desired sphere; 

with the notations and results of 9.7.3, we have r(zn, 2:1,.. ., Zn+i) = 0 and 
~ rf — 2Rri cos <f>i . Plugging the values of do,- (i = 1,..., n + 1) into 
r(zo, 2 : 1 ,..., Zi 4 . 1 ) and carrying out some row and column subtractions, we 
find a second-degree equation for R, say aR? -I- PR -I- 7 , so there are only two 
possible values for the radius. But knowing R determines doi = x^Xi, so by 

9.7.1 there are at most two possibilities for xq- CH 

10.7.9. Examples. The case of orthogonal spheres is special and has at 
most one solution, for if cos^, is zero for all t, the equation in fZ we obtain 
is of the form aR? -I- -y = 0. We shaU return to this point in 10.7.10.2. 

A search for spheres tangent to n -t- 1 given spheres would yield 2"'^^ 
solutions at most, since there are 2”''‘^ choices of <fi = 0 or ir; but if we 
replace 4>ihy ir — (j>i everywhere, the equation -I- fiR -I- 7 is not changed, 
so there are at most 2"'*'^ solutions. This upper bound actually occurs, and 
each combination of = 0 or tt admits two or zero solutions — see figures 

10.7.9.1 and 10.7.9.2 for the two-dimensional case. But the lower bound zero 
for the number of solutions also occurs! 



Figure 10.7.9.1 


10.7.10. Power of a point with respect to a sphere. Consider 
S = S[a, r) and x € X. For every line D containing x and intersecting 5 at I 

and t' (where we may have t = t'), the inner product [xt \ z?) is a constant, 
equal to xa^ — r^. This number b called the power of x with respect to S, 
and denoted by P^S. 

The absolute value of Pj-.S is xt ■ xt', and its sign is positive or negative 
depending on whether x is inside or outside the segment |l,l']. 
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Figure 10.7.9.2 


The proof is very simple: call h the midpoint of |t, t'], and write 

(xt I i?) = xh^ — ht^ = xa^ — ah^ — ht^ = xa^ — . 

In practice, PxS is obtained by plugging in x into the equation of S (assuming 
the coefficient of || ■ |p is 1). 

10.7.10.1. Let S = 5(a,r) and S' = S(a',r') be eccentric spheres (o ^ a'). 
The set { X G X I P^S = PxS'} is a hyperplane (use 9.7.6.5 or analytic 

geometry), orthogonal to oo^, and called the radical hyperplane (or radical 
axis if n = 2) of 5 and S'. If the two spheres are secant, the radical hyperplane 
is merely the hyperplane containing their intersection; otherwise, it can be 
constructed geometrically as shown in figure 10.7.10.1, which uses the defining 
property of the power of a point. 

10.7.10.2. Orthogonality between spheres (cf. 10.7.7) can be expressed in 

various ways, and is, for this reason, a useful notion. Two spheres S = S(a, r) 
and S' = S(a', r') are orthogonal if and only if aa'^ = + r'^, or PaS' = r^, 

or P,i'S = r'^, or there exists a line D going through a and intersecting S' 
(resp. S) at points t,t' (resp. x,x') such that [x,x',t,t'] = —1 (harmonic 
division). And, sure enough, if and only if T^S ± T^S' for some (or any) 
X G 5 n 5'. 
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Figure 10,7,10.0 



We deduce from this that the center of a sphere orthogonal to two others 
lies in the radical hyperplane of the two; a sphere orthogonal to n + 1 others 
(in an n-dimeiisional space X) has its center in the intersection of all the 
radical planes determined by the others, which is a point, called the radical 
center of the n-f-1 spheres. This gives a simple proof of the example discussed 
in 10.7.9. 

10.7.10.3. The notion of power enables us to solve a number of construction 
problems. For example, finding a circle in the plane containing two points 
and tangent to a given circle (figure 10.7.10.3.1). 

Another example: the theorem of the “sixth circle” demonstrated in 
figure 10.7.10.3.2 (not to be confused with the theorem of six circles of Miguel, 
whose proof is much more involved). For a recent reference on such theorems, 
see [DI, 256]. 
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Figure 10.7.10.3.1 



Figure 10.7.10.3.2 


10.7.11. Polarity. Polaxity with respect to a sphere is a particular case 
of polarity with respect to a non-degenerate quadric (cf. 14.5); it presents no 
special interest, and in fact has the drawback (In the afRne case) of being 
subject to exceptions, caused by the points at infinity. We just mention two 
particularities of this case of polarity: first, the polar hyperplaiie H of x with 
respect to S{a, r) is orthogonal to ax, and determined by 

r2 
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second, polarity with respect to a sphere can be defined in an elementary way 
as follows: two points x, y are conjugate with respect to S' if the sphere of 
diameter [ 2 :,yl is orthogonal to S (this is a consequence of 10.7.10.2). 

See an application in 10.13.14. 


10.8. Inversion 


Our treatment of inversions here will be elementary. In chapter 20 it 
will be expanded in order to avoid exceptional cases and allow the use of 
quadratic forms, a powerful tool. Thus this section can be read for its own 
sake or as motivation for chapter 20. 

10.8.1. Definition. The inversion of pole c and power a G R* is the map 
i = I'c.a : X \ c —> X \ c defined by 


where we have vectorialized X ate. 

10.8.1.1, We always have P = Idx\c. If a < 0, there are no fixed points; 
if a > 0, the set of fixed points is the so-called inversion sphere = 

{ z € I i(a:) = x }, and we say that ic.„ is the inversion through S. Every 
sphere S such that P^S = a is globally invariant under t^.a, as can be seen 
from 10.7.10. Every set of the form Y \c, for Y a subspace of X containing 
c, is also invariant, and is the inversion of pole c and power a in K 

(heredity). 



10 .8.1.2. We have 

*c.a — Hc,ctp-^ lx\c' 

10.8.1.3. For z, y G X \ c, we have 


»(z)t(y) = |a| 


' cx ' cy 


This follows from a trivial calculation in the vectorialization of X at c. 
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10.8.2. Transforms of spheres and hyperplanes. Let i = ic.a 
be an inversion in X. The image f(H) of a hyperplane H not containing 
c is a punctured sphere S \c, where 5 is a sphere of X going through c. 
Conversely, the image of S\c, where 5 is a sphere containing c, is a hyperplane 
not containing c. The image of a sphere not containing c is a sphere not 
containing c. 

To see this, Use an appropriate homothety and 10.8.1.2 to reduce to the 
case a = ch^, where h is the orthogonal projection of c onto H. Putting 
i(x) = x' for X S H and calculating in Xc, we get 

||hf; 


but X 


so that 


e .ff <==> (a:' - h I x') = 0, 


which means that x' belongs to the sphere of diameter |c,/i]. If 5 is a sphere 
not containing c, we get i(5) = S li PfS = a, by 10.8.1.1; the general case can 
be reduced to this one by applying an appropriate homothety and 10.8.1.2. 





10.8.2.1. Notes. For the study of sphere-preserving maps, see 9.5.3.2, 
9.5.3.6 and 18.10.4. 
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10.8.3. Mechanical linkages. There exist mechanical linkages that 
realize inversions; their use lies in transforming a circular movement into a 
rectilinear one, since a circular movement is mechanically very easy to achieve 
(think of a compass or a crank), whereas straight lines are not (in order to 
obtain a straight ruler, one has to rub one ruler against another). The reader 
can check that the mechanisms in figures 10.8.3, where the point c is fixed and 
the others move around, act as inverters. For other mechanical linkages, see 
[LBl, 64-88]. See also 9.14.34.A. It is interesting to notice that the problem 
of finding a linkage yielding straight lines stayed open during the nineteenth 
century for many years. 



Figure 10.8.3 

10.8.4. Conjugation by inversions. Let g be an inversion, and / a 
hyperplane reflection an or an inversion through the sphere S (cf. 10.8.1.1). 
The map gfg~^ is a reflection in the first case, and an inversion through 
g(iS) in the second. 


Figure 10.8.4 

10.8.4.1. Proof, We use the following criterion, a consequence of 10.7.10.2: 

two points X, x' are inverse to each other under the inversion through the 
sphere S, or symmetric to one another with respect to the hyperplane S, if 
and only if every sphere containing x and x' is orthogonal to S (a sphere 
and a hyperplane are orthogonal if the hyperplane contains the center of 
the sphere). This criterion implies 10.8.4 once we know that inversions map 
orthogonal spheres and hyperplanes into orthogonal spheres and hyperplanes; 
this will be proved in 10.8.5.3. □ 

10.8.4.2. Note. The reader will have noticed how cumbersome it is to always 
have to say “sphere or hyperplane”, and also that 10.8.4 is incorrect, in that 
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gfg~^ is not defined everywhere (one must exclude the poles of / and g). For 
these two reasons, we begin to feel the need for a structure unifying spheres 
and hyperplanes, and for being able to compose inversions and hyperplane 
reflections without restriction; this will be one of the aims of chapter 20. 


10.8.5. Inversions and differential geometry. Let i = and 
vectorialize X at c. The map i is a C°° diffeomorphism, and its derivative is 
given by 


10.8.5.1 


‘'( 2 :)(?/) 




2(a: I y) 
IkP 



This follows from the definition of t and diiferential calculus (cf. [CHl]). 

10.8.5.2. Corollary (inversions preserve angles). Let i = ic.a be an 
inversion in an n-dimensional space X. The derivative i'(x) at any point 
X S X \ c is a similarity, preserving orientation if a” < 0, and reversing it 
if a" > 0. In particular, the derivative preserves angles between lines and 
oriented lines, and switches the sign of oriented angles when X is a plane. 

Proof. FVom 10.8.5.1, i'{x) is the composition of the reflection through the 
vector hyperplane x-^ and the homothety of ratio a/|| 2 :||^; the corollary follows 
from 8.8.5. □ 



10.8.5.3. In particular, inversions preserve orthogonality relations between 
spheres and hyperplaiies. This could be shown directly, of course, without 
resorting to differential calculus. 

10.8.5.4. Note. In 9.5.4.6 (Liouville’s theorem) we classified diffeomor- 
phisms whose derivative is everywhere a similarity. 

10.8.5.5. Inversions and osculating circles. Let X be a plane, i an 
inversion of pole c, and / a curve in X. The image t (C) of the osculating 
circle C to / at f(t) is the osculating circle to t o / at i{f(t)) (at least if 
c ^ C). In fact, i preserves circles, and the osculating circle is the limit of 
the circle passing through three points of the curve as they get ever closer. 
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In any case, the result above gives a construction for the center of curva¬ 
ture of the inverse C of a curve C, given the center of curvature of C (figure 
10.8.5.5). 

An interesting corollary of 10,8.5.5 is the following: the torsion of a simple 
closed curve on the sphere vanishes at four points at least (see |B-G, 9.7,9]). 

10.8.5.6. See the interesting article [BA-WHj. 


10.9. Circles in the plane 


In this section and the next X is a plane. 

This, after lines in the plane, is the simplest situation in geometry, and per¬ 
haps the simplest of all, since, as observed in 10.8.3, a compass is more natural 
than a ruler. There are myriads of results, of which we mention only a few 
(see [COO] for more). The essential tools are the power of a poiut, inversions 
and oriented angles (cf. 8.7). In fact, the principal application of oriented 
angles is exactly to such problems, mostly via 10.9.5. 

10.9.1. Notation and convention. Until the end of this chapter we 
denote the line passing through two distinct points x,y by xy = {x,y) . If z 
belongs to a circle C determined by the context, we denote by zz the tangent 
T^C to C at z. 

Our underlying theme will be the cocyclicity of four points z, y, z, t of X. The 
first criterion we give is purely metrical: 

10.9.2. Proposition (Ptolemy’s theorem). If (x{) (i = l,2,3,4) 
are points in the plane, and their distances are denoted by dij = XiXj, the 
following inequality always obtains: 

di2^34 ^ <^i3<^42 + dind2Z- 

If equality holds, the points are on the same circle or line. For four points in 
the plane to be cocyclic or collinear it is necessary and sufficient that one of 
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the three equalities 




be satisfied. 


Proof. We have already shown this in 9.7.3.8, but a more elementary demon¬ 
stration is welcome. Set Xi = x, X 2 = y, 23 = z, X4 = c, and let i be the 
inversion of pole c and power 1. Then the points x' = t ( 2 :), y' = i(y)t = t(^) 
satisfy 


x'y' 


xy 


y'z' 


yz 


cx cy 


cy ■ cz 


cz ■ cx 


Thus 


cz ■ xy + cx ■ yz — cy ■ zx = {cx ■ cy ■ cz){x'y' + y'z' — x'z'), 

and the conclusion follows form 9.1.1.1 and 10.8.2 if we remark that, among 
three collinear points, one is always between the other two. □ 



Figure 10.9.2 



10.9.2.1. Historical note. Ptolemy used this relation to produce a “table 
of chords”; if \x,z\ is a diameter of the circle, the formula xz ■ yt = xy ■ tz + 
xt ■ yz can be rephrased as the trigonometric formula sin (a -f 6 ) = sin a cos b -I- 
sin6cosa (cf. 8.7.8.1 and 10.3.2). See also 9.7.3.8. 

10.9.2.2. The most useful condition for cocyclicity involves oriented angles. 
We state it under the conventions established in 8.7 and 10.9.1; thus the 
statements below are still valid even if some of the points are not distinct. We 
give the proofs in the general case; the reader can take care of the particular 
cases, using the fact that a tangent is orthogonal to the radius. 

10.9.3. Proposition. Let C be a circle of center uj, and a,b two points on 
C. For every x S C we have (cf. 8.7.7.7) 
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Proof. Let y be the point diametrically opposed to x; by 8.7.2.4 (iv), it is 
enough to verify that 

oJa, ujy = 2xa, xy. 

Let D be the perpendicular bisector of \x, a], and consider the reflection od- 
By 8.7.2.4 (v), 


wa, uy = ijia, xa + xa,u)y = xa, xu) + xa, xu) = 2xa, xuj. 


□ 


10.9.4. Corollary. Let a,b € X be distinct points. For any a 6 Ji{X) \0, 
the set 

{ X e AT I xa, xiU = a } 
is a circle containing a and b. 

Proof. Given a and b, one easily constructs a point uj such that tva = u)b and 

tL)a,wb = 2a] for example, u) is the intersection of the perpendicular bisector 
of [a, 6] and the line D containing a such that ab,D = 6 ~ a (cf. 8.7.7.4). 

The circle C of center ui and passing through a and b is such that xa,xb = a; 
conversely, if the two angles are equal, the center uj' of the circle containing 

a, b and x satisfies u)'a,u)'b = 2a, and can be no other than ai. □ 



10.9.5. Corollary. Four distinct points a,b,c,d in X are cocyclic if and 
only if ca, cb = da, db. 

Proof. This is obvious if a,b,c are collinear (see 8.7.7.3, for example). Oth¬ 
erwise ca, cb ^ 0 and we apply the two previous results. □ 

10.9.6. Remarks. The condition in corollary 10. 9.5 is fals e for ii on- orient ed 
angles. What is true is that cocyclicity implies ca, cb = da, db or ca, cb -f 
da, db = TT, but the converse does not hold. Thus 10.9.5 provides a significant 





advantage, especially in the case of complicated figures, where the number of 
cases to consider would be large. See also 10.13.15. 

Notice the special case a = S, which yields the circle with diameter [a, 6 ], 

We remark that 10.9.3 and 10.9.5 will be obtained as particular cases of a 
general theorem on conic sections in 17.4.2. 

10.9.7. Examples. Corollary 10.9.5 has numerous applications. The reader 
can tackle exercises 9.14.3 and 10.13.18 and leaf through the exercises in [D- 
Cl], [D-C2|, (I-R], [R-C], [HD]. Here we discuss only two applications, start¬ 
ing with the Simson Hue, already mentioned in 10.4.5.4 (see also 9.14.34.3.D, 
10.4.5.5, 10.9.7.10, 10.11.3, 10.13.27, 17.4.3.5, 17.8.3.2). 

10.9.7.1. Let {a, 6 ,c} be a triangle. In order that the projections p,g,r of a 
point X on the sides be colliiiear it is necessary and sufficient that x belong 
to the circle circumscribed around {o, 6 ,c}. 

For this kind of proof, one can often just look at the figure and read out 
chains of equal angles. Here we work as follows: since pc,px = i ^ and 

Ja,'^ = S = ra,rx, we get, applying 10.9.5 four times, cb, cx = cp,cx = ^ 

and ab, ax = or, ox = ^, But the condition “p, q, r are collinear” is 
equivalent to ^ so also to cb,cx = ab, ax, and this is equivalent 

to ‘‘a,b,c,x are cocycUc” by 10.9.5. 

This theorem was attributed to Simson for a long time, but it is actually 
due to Wallace. 

10.9.7.2. Miguel’s six circle theorem. Let Ci (i — 1,2,3,4) be circles 
such that Cl DC^ = {a n o'}, C 2 O C 3 = {& n fc'}, C 3 fl C 4 = {c n c'} and 
C 4 n Cl = {dnd']. Then a,b,c,d are cocyclic if and only if a',b',c',d! are. 
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Figure 10.9.7.1 



Figure 10.9.7.2 


Proof. Using Chasles’s relation and 10.9.5 (four times), we get 

ba, be = ba, bb' + bb',bc = a'a, a'b' + c'b', e'e, 
da,dc = da,dd' + dd',dc = a'a,a'd' + c'd',c'c. 
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Subtracting and applying Chasles again, 

ba,bc — da,d.c = a'd',a'b' — c'd',c'b', 
and the conclusion follows form 10.9.5. 

10.9.8. Note. Miguel’s theorem is the first in a “chciin of theorems”, 
cf. 10.13.19. On chains of theorems see [PE, 431] and [CRl, 262, 258]. See 
also 10.11.7. 

10.10. Pencils of circles 


The following facts are simple consequences of the discussion in 10.7.10. 


10.10.1. Definition and proposition. Let C,C' be non-concentric 
circles. Any two elements of the set J = { circles T | T JL C and FTC'} 
have the same radical axis, the line determined by the centers u),w' 

of C,C'. The set 7 is called a pencil of circles. If F,?' belong to a pencil 7 , 
the pencil formed by circles O such that C ± F and C ± F' is denoted by 7 -^ 
and called the pencil orthogonal to 7 ] in this case C ± F for every C 6 7 ‘*‘ 
and F 6 7 , and we have 7 ''“'‘ = 7 . Let C and C" be two distinct circles 
in a pencil 7 , If C and C' are tangent, 7 consists of circles tangent to one 
another, and so does 7 ' (figure 10,10.1.1). If C n C' = 0 , there exist points x 
and y (on {w,w')) such that 7 -^ = {circles C \x,y C) (figure 10.10.1.2); 
we call X and y the limit points of 7 . If CoC' contains more than one point, 
the elements of 7 "^ are pairwise disjoint (figure 10.10.1.3). □ 


More need not be said, if only because we will discuss pencils of circles at 



length in chapter 20. 


Figure 10.10.1.1 
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Figure 10.10.1.3 
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10.10.2. Reduction of two circles. One use for lO.lO.l and the 
inversion of circles is afforded by the following fact; two circles C, C can 
always be transformed by an inversion into either two concentric circles or 
two lines. 

If Cr\C' ^ 0, make one of the intersection points the pole of the inversion; 
if CnC' = 0, choose instead a limit point, as defined above. Since inversions 
preserve orthogonality (cf. 10.8.5.3), the image of the second limit point under 
the inversion will be such that all lines going through it are orthogonal to the 
image circles; this can only happen if that point is the center of both image 
circles. 



Figure 10.10.2 

Observation 10.10.2 solves a number of problems about circles, for ex¬ 
ample, the study of circles intersecting two given circles at the same angle, 
or the construction of a circle intersecting three given circles at given angles. 
All this is possible because the conditions in question are preserved under 
inversion. Another, deeper way of phrasing this remark is by saying that the 
set of two concentric circles has a big stabilizer G, containing all bometries 
that leave invariant the common center of the circles; similarly, two intersect¬ 
ing lilies are left invariant by any homothety centered at their intersection 
point. Conjugated by an inversion, the group G becomes a group G that 
stabilizes the two circles originally considered, and G is made up of maps 
which preserve circles, lines and angles. (They are not really maps from E 
into itself, but from E into itself, E being introduced in chapter 20 exactly 
to make precise the kind of argument we have been presenting.) 

Now an example of an application of 10.10.2. 

10.10.S. Steiner’s alternative. Let C and C be two circles, with C 
inside C, and let Fi be a circle externally tangent toC and interiially tangent 
to C. Construct a chain of circles F, (i = 1,2,...) as follows: for each i, let 
Fi+j be tangent to F^, C and C, and distinct from Fj-i. If we have F,- ^ F 
for every i > 1, too bad; but if F„ = F^ for some n, then for any other initial 
circle F„ we’ll have FJ, = Fj. 

The proof is trivial if we transform C,C' into two concentric circles, for 
a pair of concentric circles is invariant under the group of rotations around 
their common center. 
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Figure 10.10.3 


The alternative is that either no chain closes up, or they all do. Here is 
another example of an alternative: 

10.10.4. The great Poncelet theorem for circles. As in 10.10.3, 
we start from two circles C, C with C inside, and we construct a sequence of 
points Xi (t > 1), beginning with a given Xi € C, by the following condition: 

is tangent to C for every t, and (zj,z,+i) ^ (z,-,z,_i). Then it 
can be shown that either there exists n such that x„ = Xi for all Xi £ C, or 
Xi ^ xi for all Xi e C and i > 1. 

This result is much harder to prove than Steiner’s alternative, in spite 
of the simplicity of its statement (notice that straight lines are not preserved 
under inversions, so the previous reasoning does not apply). We shall see a 
proof of it in 16.6. See also 10.13.3, 17.6.5, and |B-H-H]. 



Figure 10.10.4 


10.10. Pencils of circles 


317 


10.11. Classical problems 

We collect here some classical results about circles. The reader can try 
his hand at proving them or consult the references, especially [COO], an 
excellent and systematic book. 

10.11.1. APPOLONIUS’ problem. The question is to find a circle F tangent 
to three given circles C,C',C". By 10.7.9 we know there are at most eight 
solutions. 

An elementary method consists in adding or subtracting from the radii 
of the three circles the radius of the smallest one, thus reducing the problem 
to finding a circle containing a given point and tangent to two given circles 
(figure 10.11.1.1); this can be done by using 10.10.2 (see 9.6.6 if necessary). 



Figure 10.11.1 2 
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A much more elegant method, due to Gergonne, gives the contact points 
with C, C',C" directly. In figure 10 . 11 . 1 . 2 , the point w is the radical center 
(cf. 10.7.10.2) of C,C',C", that is, the point having the same power with 
respect to each of the three circles. This can be easily found using 10.7.10.1. 
The line D is one of the four lines connecting the six centers of the homotheties 
taking C into C", C into C" or C" into C (the six centers do fall into four 
lines). Finally, 7 , 7 ' and 7 " are the poles of D with respect to C, C and C" 
(cf. 10.7.11). The desired contact points are given by the intersection of C 
(resp. C, C") with the line (resp. ( 01 , 7 '), {ui,')")). 

See [CRS] for a recent reference. 

10.11.2. Problem of Napoleon-Mascheroni. Using a compass only 
(cf. 10.8.3), find the center of a given circle. See |LBl, 25 ff.], more pre¬ 
cisely, the historical note at the bottom of page 25 and the statement of the 
theorem of Mohr-Mascheroni on page 26: “Every construction possible with 
a straightedge and compass is possible with the compass alone.” A good 
portion of [LBl] reads like a novel. 

10.11.3. The nine-point circle and the Feuerbach theorem. 

Let T = {x,y,z} be a triangle, p,q,r the feet of the altitudes, u,v,w the 
midpoints of [z,/i], [t/,/i], where h is the intersection of the altitudes 

(cf. 10.2.5), w the center of the circle circumscribed around T and g its cehter 
of mass. Prove that the nine points u,v,w,l,m,n,p,q,T all lie on the same 
circle P whose center O is the midpoint of h and w. Show also that g is on the 

line {h,0,w) and that gh = —2gu. The circle P is tangent to the inscribed 
and exinscribed circles (cf. 10.1.5 and figure 10.1.5). 


X 



Figure 10.11.3 
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The proof utilizes the inversion of pole u and power up us, where s is 
the foot of the interior bisector stemming from x. Notice also that F is the 
circle tritangent to the three-cusped hypocycloid found as the envelope of the 
Simson lines of T: cf. 10.4.5.5, 10.9.7.1 and 9.14.34. 

10.11.4. CaSTILLON’S problem. Given a circle C and points .n 

not on C, find a polygon (z,),= i.„ such that x, G (z,,Zi^i) for all i = 

l,...,n. One rather complicated solution consists in taking the product 
of the inversions fi of pole z,- that leave C globally invariant; then (/„ o 
/„_! o •. ■ o /2 o fi){zi) = Zi, and we just have to find the fixed points of 
fn ° fn-i ° ° h ° fi- A better method, which holds for all conics, will be 

given in 16.3.10.3. See also [DO, 144]. 



10.11.5. MaLFATTI’s problem. Given a triangle T, find three circles 
C,C',C" that are each tangent to two sides of T and to the other two cir¬ 
cles. For the solution, see [HD, volume 1, 310], [R-C, volume 1, p. 311-314], 
[DO, 147] or [COO]. 



10.11.6. LaguERRE’S circles. This deals with oriented circles and lines, 
tangent among themselves with the right orientation; the result would be 
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false if matching orientation were not required. A complete exposition can be 
found in [PE, 426]. For an algebraic formulation of this theory, see [BZ, 251 
ff.]. See also [BLA3, chapter 4|. 

10.11.7. The seven circle theorem. For this very nice result, see 

[E-M-Tj. 

10.12. Parataxis; prelude to sections 18.9, 20.5 and 
20.7 

This section discusses a number of surprising phenoiiiena relating to cir¬ 
cles and spheres in ordinary three-space. They will be explained in two ways: 
geometrically in 18.9, and algebraically in 20.5.4. 


10.12. Parataxis: prelude to sections 18.9, 20.5 and 20.7 
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10.12.1. Let r be a torus, that is, the surface or revolution created when a 
circle rotates around a line disjoint from and coplanar with it. A torus con¬ 
tains two families of circles, the paralleb of latitude and the meridians. The 
first surprising fact is that T contains other circles as well. These are some¬ 
times known to mathematicians as Villarceau circles, but they were known 
much before he was around (1848), as attested by the I’Oeuvre Notre-Dame 
museum in Strasbourg, where the spiral staircase is topped with a torus 
sculpted in such a way that its live edges are exactly these circles (figure 
10.12.1.2). Villarceau circles are found by intersecting T with a bitangent 
plane (figure 10.12.1.3). 



Figure 10 12.1.1 Figure 10.12.1.3 



10.12.2. These exotic circles are divided into two fajiiilies {C(t)} and {?(!)}. 
Two ciixles C(t) and r(0) from different families intersect in exactly two 
points. Two circles from the same family not only do not intersect, but 
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Figure 10.12.1.2 (Source: (HG)) 
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they axe always linked (for the definition of linkedness, see [B-G, 7.4.8], for 
example), as demonstrated in the Strasbourg torus. 

10.12.3. Villarceau circles satisfy strong angular properties. First, they are 
helices of the torus, that is, they intersect each meridian at a constant angle. 
What is more, two different circles C(t) and C(t') (or r(t) and r(t')) form 
a paratactJc annuius, that is, every sphere S containing C(t) intersects C(t') 
at a constant angle a, and every sphere S' containing C'(t) intersects C{t) 
at a constant angle a. 



10.13. Exercises. 

* 10.13.1. The altitudes of a triangle are concurrent. 

10.13.1.1. Deduce this fact from the following property (show it); for every 
four points {a,b,c,d} in a Euclidean plane, we have 

(ab \ cd) + (ac | db) + [ad | &c) = 0. 

10.13.1.2. Deduce it uow from the Theorem of Ceva (2.8.1). 

10.13.1.3. Deduce it from 16.5.4 by using cyclic points. 

10.13.2. More formulas. Show that the following relations hold for a 
triangle; 

= rr„r,,rc, 4fZ = r„ + r,, + - r, ^ 

2Rr ab be ca 

1111111 , 

- --h — H-— r-1'7- + 77i P = '"uD. + 

r r„ r(, r,. /i„ hi, hr 

a + fc + c 

sin A + siu B + sin O 


2R = 
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* 10.13.3. THE GREAT PONCELET THEOREM FOR CIRCLES (n = 3,4). 
Given a triangle T, show that the radius R of the circumscribed circle C, the 
radius r of the inscribed circle F and the distance d between the two centers 
satisfy the relation R? — 2Rr — Show that if, conversely, two circles C 
and r of radii R and r whose centers are d units apart satisfy the condition 
R^ — 2Rr = d?, then for any x G C there is a triangle T inscribed in C and 
circumscribed around F, and having x as a vertex. Deduce from this fact the 
theorem of Poncelet for the case of triangles (16.10.4). Make a similar study 
for a quadrilateral inscribed in C and circumscribed around F (see [COO]). 



10.13.4. MORLEY’S theorem (cf. 10.3.10). First assume the theorem to be 
true, and show that, in the figure, the triangles {u, r, 9 }, {u, r, p} and {lu, p, g} 
are isosceles; find their angles as a function of the angles of {x,y,z}. Deduce 
from this a proof of Morley’s theorem, starting from an equilateral triangle 
{r,p,q} and constructing u,v,w and finally x,y,z with the desired angles, 
thus obtaining a triangle similar to the original one. 


X 



Figure 10.13.4 
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10.13.5. Show that, for every t in the plane and every triangle T = {x,y,z}, 
we have tx + ty + tz > 1\JyfZS, and that equality holds if and only if T is 
equilateral. Deduce that tx + ty + tz > 6 r. 

10.13.6. Fill in the details of the discussion in 10.4.5.4. 

10.13.7. Brahmagupta’S formula. For a quadrilateral inscribed in a 
circle, calculate the diagonals as a function of the sides a, b, c, d. Show that 
the area of the quadrilateral is 

V{P - «)(P - ^)(P - c)(p - d), 

where p = (o + 6 + c + d)/2. Show that a formula like 

area = P^n(l-^) , 

generalizing Heron’s formula (10.3.3) and Brahmagupta’s formula to the case 
of arbitrary polygons incribed in a circle, is false when they have more than 
four sides. See [LEV]. 

* 10.13.8. Given a convex quadrilateral, find the minimum of the function 
“sum of the distances from a variable point to the vertices of the quadrilat¬ 
eral”. Do the same for a convex hexagon circumscribed around an ellipse 
(cf. 16.2.13). 

10.13.9. Calculate the radius of a sphere circumscribed around a regular 
tetrahedron. 

10.13.10. Calculate the volume of a tetrahedron as a function of; 

i) two faces, the common edge and the dihedral angle between the two; 

ii) the four faces, two opposite dihedral angles and the corresponding edges; 

iii) one face and the adjacent dihedral angles. 

10.13.11. Show that if the four faces of a tetrahedron have the same area, 
they are congruent. 

10.13.12. In a tetrahedron, call a,/?, 7 the distances between pairs of oppo¬ 
site edges (cf. 9.2.5), and hi (t = 1,2,3,4) the distances between a vertex and 
the opposite face. Then 

J_ J_ ^ J_ 

^ + + 

i ’ 

10.13.13. Ill a tetrahedron, the product of the sines of two opposite dihedral 
angles is proportional to the product of the corresponding edges. 

10.13.14. Salmon’s principle. Consider in a Euclidean plane a circle 
C centered at o, two points x, y and their polars , D,, with respect to C. 
Show that 

ax d[x,Dy) 

ay d(y,Dx) 
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Show that if P is a 2 n-sided polygon inscribed in C and a is a point on 
C, the product of the distances from z to the even-numbered sides of P is 
equal to the product of the distances to the odd-numbered sides. If P is 
circumscribed around C, show that the product of the distances from even- 
numbered vertices to a fixed tangent to C is proportional to the product 
of distances from odd-numbered vertices, the ratio not depending on the 
tangent. 

10.13.15. Given two distinct points a and f) in a Euclidean plane and an angle 
Q, oriented or not, between lines or half-lines, study the sets { x\xa, xb = Qi}, 
{ X I xa, xb = Q! }, { z I xa, z6 = a }, { z j zo, z6 = cr }. 

10.13.16. Line of the images. Let T be a triangle and D a line. Show 
that the reflections of D through each side have a point in common if and 
only if D goes through the intersection of the altitudes of T. What happens 
when D rotates around this intersection point? (See also 17.6.2.2.) 

10.13.17. Give an algebraic proof of 10.9.7.2 using cross-ratios (cf. 9.6.5.2). 

10.13.18. The pivot. Consider a fixed triangle {z, t/, 2 } in the plane. 
Show that, given three points p, q, r on the sides of this triangle and a point 
TT in the plane, {z,q,ir,p} are cocyclic if {x,r,n,q} and {t/, p, 5 r,r} are. Now 
assume tt, called the pivot, is fixed, and that p, q, r slide along the sides so 
that the cocyclicity conditions are satisfied. Show that the various positions 
of {p, 9 , r} are all similar (under orientation-preserving similarities). Show 
that any point linked to {p, 9 , r} by an orientation-preserving similarity (for 
example, the intersection of the altitudes, or of the bisectors, etc.) describes 
a line, and every line linked to {p, 9 , r} by an orientation-preserving similarity 
has as its envelope a parabola of center jr (cf. 9.6.7). 


X 



Figure 10.13.18 
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* 10.13.19. A CHAIN OF THEOREMS. In this problem the objects axe sup¬ 
posed to be “in general position”. Let (Di)i = 1,2,3,4 be four lines in a 
Euclidean plane, and let Ci be the circle circumscribed around the triangle 
formed by the three lines Dj (j ^ t); show that the four circles C, have a 
point in common. (See another explanation in 17.4.3.5.) Draw an illustrative 
picture. 

Let (D,),= i. 5 be five lines in a Euclidean plane, and let p, be the point 

associated in the preceding theorem with the four lines Dy (j ^ t); show that 
the five points pi belong to the same circle. Draw a picture. 

State and prove a chain of theorems whose two first elements are the 
results above. On the subject of theorem chains, see [PE, 431, exercise 94.7] 
and [CRl, 258]. 

10.13.20. ISOGONAL CIRCLES. A circle P is said to be isogonal with respect 
to two circles C, C' if it makes the same angle with the two. Show that the 
circles isogonal with respect to two fixed secant circles are all orthogonal to 
the same circle or line. Analyze the situation when the two circles are not 
secant. Replace isogonality by the condition that the angles of F with C, C 
have sum if. Study circles isogonal with respect to three and then four fixed 
circles. Can your results be used in problem 10.11.1? 

10.13.21. Let C,C' be two circles as in 10.3.10, with radii R,R' and centers 
a, a' such that aa' = d. Show that if C and C admit a chain of n circles as 
in 10.10.2, we have (R — r)^ - d? — 41Zrtan^ (x/n). 

10.13.22. Prove that the bitangent plane in figure 10.12.1.3 intersects the 
torus along two circles, and that these circles are helices for the torus. 

10.13.23. Using the technique of 10.3.10, show how the intersection points 
of the internal and external trisectors of a triangle fall into 27 lines. See 
9.14.34.5 and [LBl, 173-208]. 

10.13.24. Given a plane triangle T, study the circles with respect to which 
T is self-polar, that is, its vertices are pairwise conjugate with respect to the 
circle. Same question in dimension three. 

10.13.25. Consider three pairwise tangent circles, with radii a, fc, c, and 

set Q = a~^, P = 6“^, 7 = c“^. Show that the radii of the two circles 
tangent to these three are (a + P + y -h 2y/Py + ya + aP)~^ and [a -|- -|- 

y - 2-\/Py + ya + What does the sign of a -t- ^ -f- 7 — 2y/Py + ya + aP 

signify? 

10.13.26. Let a,b,c be the lengths of the sides of a triangle T in a Euclid¬ 
ean plane. For every integer n, consider the point A’(n) whose barycentric 
coordinates with respect to T are 

I' a’‘ _ ^ _ c" 

\o" -I- 6" -f- c" ’ a" -i- 6" -1- c" ’ o" -t- fc" -t- c" 

Find a geometric characterization for A'(O), X(l), X(2). What does X(n) 
become when n approaches -f-oo (resp. — oo)? 
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10.13.27. The SimSON line strikes again. Let D,D' be distinct lines 
in a Euclidean plane intersecting at a, and take x ^ Du D'. Show that the 
map D B m >-* m' €: D', obtained by intersecting D and D' at m and m' by 
a circle of variable radius containing a and x, is the restriction to of an 
orientation-preserving similarity of center x. What line does the orthogonal 
projection h of x onto mm' describe? Deduce theorem 10.9.7.1. 



* 10.13.28. Given four points a,b,c,dm a Euclidean plane, construct a square 
ABCD such that a € AB, b € BC, c € CD and d £ DA. 

More generally, given two quadrilaterals Q and Q' in the plane, construct 
a quadrilateral Q" similar to Q and inscribed in Q' (resp. circumscribed 
around <?'). 

A 



Figure 10.13.28 
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* 10.13.29. Ford circles. Consider three pairwise tangent circles, and also 
tangent to a line D, in the situation shown in figure 10.13.29.1. If r and s are 
the radii of 7 and 6, and p is the radius of the small circle, find p, ax and bx 
as functions of ab, r, and s, where a, b, and x are the tangency points of 7 , S 
and the small circle with D. 



Figure 10.13.29.1 


Starting now from two circles whose equations, in an orthonormal frame, 
are x^+y^—y = 0 and x"^ +y'^ — 2x—y+l = 0 , use induction to construct circles 
as shown in figure 10.13.29.2. Show that the tangency points of these circles 
with the x-axis always have rational abscissas. Are all rational numbers in 
[ 0 , 1 ] obtained in this way? 



Figure 10.13.29.2 
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Ford circles have found a nice application in arithmetic—one performs 
complex integrations along them ([RA, 267]). 

10.13.30. Nagel’s point. Prove, in at least two different ways, that the 
three lines joining the vertices of a triangle with the point of contact of the 
inscribed circle with the opposite sides are concurrent. 

10.13.31. The butterfly. A quadrilateral ABCD is inscribed in a circle 
of center O as in figure 10.13.31. Through the point P = ABOCD one draws 
a line orthogonal to OP. This line meets AD in M and BC in N. Show that 
PM = PN. 


A 



FTgure 10.13.31 


10.13.32. A quadrilateral with given sides has maximum area when it is 
inscribed in a circle. 

10.13.33. A quadrilateral is Inscribed in a circle. Show that the four ortho¬ 
centers of the four triangles determined by this quadrilateral are located on 
a circle having the same radius as the initial circle. 



Chapter 11 

Convex sets 


Convex sets arise naturally in geometry, but they also play 
a fundamental role in analysis, and even in arithmetic and dif¬ 
ferential geometry. Here we discuss only the geometric aspect 
of the notion of convexity, apart from a few words about convex 
functions. Convex polyhedra, a particular class of convex sets, 
will be discussed in detail in the next chapter. 

Section 11.1 presents some non-trivial constructions for con¬ 
vex sets, to be used later: the Minkowski sum, the polar convex 
body of a convex set with respect to a sphere, and the Stein sym- 
metrization of a convex set. In section 11.3 we classify convex 
sets and their boundaries up to homeomorphisms: the algebraic 
topology of convex sets is very simple. (When it’s so easy to 
classify one’s objects of study, one may as well make the best 
of it...) 

Section 11.4 is classical and includes separation theorems 
derived from the Hahn-Banach theorem. Such theorems are 
used, among other things, to show that polarity between convex 
sets is a good duality relation, and, in section 11.6, to study the 
boundary points of a convex set. Boundary points and duality 
play an essential role in chapter 12. 

In section 11.7 we offer Helly’s theorem, a spectacular re¬ 
sult with a very simple statement. There follows Krasnosel’skii’s 
theorem, a pretty application of Helly’s theorem. Finally, we de¬ 
fine convex functions and give some of their properties in section 
11.8. Two of their applications are used later in the book: the 
theorem of Brunn-Minkowski in the proof of the isoperimetric 
inequality, and the theorem of Loewner-Behrend in a character¬ 
ization of affine quadrics. 
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This chapter only deals with real afRne spaces of finite dimension d. 


11.1. Definition and examples 

11.1.1. Definition. A subset S of an affine space X is called convex if, for 
any x,y G S, we have [z, t/] C S, where \x, t/] = { Az + (l-A)t/ I Ag [0,1]} 
(cf. 3.4.3). 

11.1.2. Examples 

11.1.2.1. Refer to the figures below; figure 11.1.2.1.1 represents a strip in the 
plane, 11.1.2.1.3 an infinite cylinder in space, and figures 11.1.2.1.4, 11.1.2.1.5 
and 11.1.2.1.6 represent the regions determined by a hyperbola, an ellipse and 
a parabola, respectively (cf. 17.1.4). FNirthermore, in figures 11.1.2.1.7 and 
11 .1.2.1.8, the thick parts of the frontier belong to the set, and the thin parts 
don’t. Among these sets, the only ones that are not convex are 11.1.2.1.2 and 
11.1.2.1.7. 



Figure 11.1.2.1.3 Figure 11.1.2.1.4 


11.1.2.2. The following are convex sets; X itself, any affine subspace of X 
(cf. 2.4), and in particular points, lines and hyperplanes. The empty set is 
convex, but, as one surmises, it gives rise to all kinds of false statements; we 
thus enact the following bailing-out maxim (cf. [VE, 198|); “If a theorem is 
false when the set A is empty, we tacitly assume A is non-empty. We sincerely 
hope never to have to resort to thb S.O.S.” 
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Figure 11.1.2.1.5 



Figure 11.1.2.1.7 



Figure 11.1.2.1.9 



Figure 11.1.2.1.6 



Figure 11.1.2.1.8 



11.1.2.3. The convex sets of R (or of X, when d = l) are the intervals (of 
all sorts). 

11.1.2.4. A subset E of X is called star-shaped (at x €: E) if [z, j/] C E 
for every y €: E- Thus a convex set is star-shaped at any of its points. See 
11.7.7 for a very nice characterization of star-shaped sets. Star-shaped, and 
consequently convex sets, are connected and path-connected. 

11.1.2.5. In a Euclidean affine space, all open and closed balls B(a, r) and 
U[a,r) are convex. In fact (cf. figure 11.1.2.1.8), for any subset A of the 
sphere S(a, r), the difference set B(a,r) \ A is convex. This is false for cubes 
(cf. figure 11.1.2.1.7). 
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11.1.2.6. Let X, Y be affine spaces, S d X and T C Y convex sets, and 
f : X —* Y an affine map. Then f(S) C Y and f~^(T) C X are again 
convex (apply 3.5.1). In particular, if C X is a hyperplane, the (open and 
closed) half-spaces determined by it are convex: just take / to be an affine 
form on X such that H = /“*( 0 ) (cf. 2.7.3), so the half-spaces are inverse 
images of intervals (like [ 0 , oo[, for instance), and intervals are convex in R 
(cf. 11.1.2.3). 

11.1.2.7. Every (finite or infinite) intersection of convex sets is convex. Every 
intersection of half-spaces (cf. 11.1.2.6) is convex. Every convex polyhedron (a 
finite intersection of half-spaces) is convex. Convex polyhedra will be studied 
in chapter 12. 



An increasing union of convex sets is convex. 

11 . 1 . 2 . 8 . Positive definite quadratic forms over a vector space E form a 
convex subset of P 2 (E). This example will be met again in 11.8.9. 

We now give three somewhat more intricate examples of convex sets, obtained 
through different procedures: 

11.1.3. The Minkowski sum. The following operation is due to Min¬ 
kowski: If S and T are convex sets in a vector space X, the set 

11.1.3.1 XS + nT = {Xs + fit\s€S,teT} 

is convex, for any two real numbers X,fi. 

The reader can prove this directly, or use 11.1.2.6 with f : X x X 3 {x,y) 

Xx fiy d X. 
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If X is an affine space, this definition only makes sense for A + /x = 1 
(cf. 3.4.1). Otherwise we can only define XS + fiT up to translations, or 
consider it as a “shape” (cf. 2.1.8). The reader is encouraged to draw the set 
XS + fj,T, and especially S — T, for sets S and T to his taste. A particular 
case of the Minkowski sum will be essential in the proof of the isoperimetric 
inequality (cf. 12 . 10 . 10 ). 

11.1.3.2. Proposition. Let X be a Euclidean vector space, A a subset of X 
and e > 0. The sets U{A, e) and B(A, e) (cf. 0.3) satisfy U[A, e) = A+17(0, e) 
for A arbitrary and B(A, c) = A + 5(0, c) for A compact. 



Proof. If d(x, A) < < there exists a point a 6 A such that d(z, a) < e; thus 
z = o + ox e A + y(0, e). Conversely, if z e A + 1^(0, e), we have z — a + az, 
where ||az|l < e, so that d(z. A) < d{x,a) < e. In the compact case, we know 
that, for any x € X, there exists y € A such that d(x,y) = d(x. A). □ 

11.1.3.3. Corollary. If A is convex, so is U(A,c) for every e > 0. If A is 
compact and convex, so is 5(A,e). □ 

11.1.4. Stein SYMMETRIZATION of a compact set. With the definitions 
and notations from section 9.13, we have: 

11.1.4.1. Proposition. If S is convex, so is st^(5). 

Proof. Let z, z' G stff(S), let D and Lf be the perpendiculars to H through 
z and z', respectively, and let [u,v] = stH(S) D D, [u',t;'] = stjj(S) D D'. 
By construction and the convexity of S, the segments [u, u] and [u', n') derive 
from segments |a,6] aud |a',&'] of S. Consider, in the affine plane determined 
by I) and D', the affine map / such that /(a) = u, f(b) = v, f(a') = u' 
and f{b') = v'. This map preserves the length of segments perpendicular 
to H', thus, if T denotes the trapezoid with vertices a,b,a',b', the trapezoid 



336 


Chapter 11. Convex sets 


with vertices u,v,u',v' will be f(T). But T <Z S because S is convex, so 
f(T) C st//(5), and, in particular, [a:,!/] C stj/(iS'). □ 



Figure 11.1.4 


11.1.5. Polar body of a convex set. Duality 

11.1.5.1. Definition. Let A be an arbitrary subset of a Euclidean vector 
space X■ The polar body of A is the set 

i4* = { «/ G X ) (x I y) < 1 for any x £ A}. 



Figure 11.1.5 




11.1. Definition and examples 


337 


By 11.1.2.7, A* is convex for every A. Definition 11.1.5.1 is intimately 
connected with the polarity transformation with respect to the unit sphere 
5 = 5(0,1) of X; in fact, the polar hyperplane of z G X is exactly the 
set {y € X I (a: I «/) = 1 }, cf. 10.7.11 and 15.5. The reader is encouraged 
to draw various sets A and find the corresponding polar bodies A*, as in 
figure 11.1.5. We shall see in 11.4.8 that, if A ranges over convex compact 
sets whose interior contains the origin, the map A » j 4* is a duality with 
excellent properties, which will be abundantly used in the study of polyhedra 
(chapter 12). 

11.1.6. Convex cones. This is an important notion: a convex set C is 

called a convex cone with vertex z if C is invariant under all homotheties 
•ffi.A of center z and ratio We won’t have the time to study this notion, 

but see, for example, [BI3, 46| or [VE|. 

11.1.7. Convexity criterion. We first mention a simple but useful 
property of convex sets; 

11.1.7.1. Proposition. Let S be a convex set in a Euclidean affine space 
X, and take x G X . There exists at most one point y €: S such that d[x, y) = 
d(z,5) (it is obvious that there exists at least one if 5 is closed and non¬ 
empty). 

Proof. This is a consequence of 9.2.2 and the following lemma: 

11.1.7.2. Lemma. Let S be a convex set in X, and let x € X and y £ S 
be such that x ^ y and d(x,S) — d{x,y). If we denote by H the hyperplane 
containing y and orthogonal to xy, the closed half-space determined by H and 
not containing x contains the whole set S. This is still true if S is merely 
star-shaped at y. 

Proof. By contradiction: let z G X be a point not belonging to that half¬ 
space. The angle yx, ^ is acute, and there exist on \y,z\ points t such that 
d(z,t) < d(z, y). □ 



Figure 11.1.7 
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The so-called Motzkin theorem states that this property characterizes convex 
sets; 

11.1.7.3. Theorem. Let S be a non-empty closed set in a Euclidean affine 
space X. Assume that, for any x 6 X, there exists a unique y €: S such that 
d(x,y) = d(x,S). Then S is convex. 

For a proof, see |V£, 94j. In the same book there is a discussion of the points 
of 5 whose distance to z is maximal (p. 98 ff.). 

11.1.7.4. A second characterization of convex sets shall be given in 11.5.4, 
using the notion of supporting hyperplane. 

Again in |V£j there is a characterization of convex sets by various “local 
convexity” conditions (p. 48 ff.}; this is an example of passing from local to 
global properties. For more examples of this philosophy, see 9.5.4.5, 12 . 8 , 

16.4. 18.3.8.6. 

11.1.8. Convex hulls 

11.1.8.1. FVom 11.1.2.2 and 11.1.2.7 it is straightforward to see that, for ev¬ 
ery subset A of an affine space X, there exists a smallest convex set containing 
A. This smallest convex set is called the convex hull of A, and denoted by 
£(A). 



Figure 11.1.8 


11.1.8.2. We shall see in 12.1.9 (and using 11 . 6 . 8 ) that the compact polyhe- 
dra of X are exactly the convex hulls of finite subsets of X. 

11.1.8.3. There are at least two importaut ways of finding £(A). One (taking 
the intersection of all half-spaces containing A) will be studied in 11.5.5; the 
other employs barycenters; 

11.1.8.4. Proposition. For any subset A of X, the convex hull £(A) is the 
set of barycenters of families of points of A with positive masses: 

t (A) = < ^ \iXi I X, e A, Aj > 0, Yhi ^ arbitrary 

'• iei 

(the notation A, assumes that Aj = 0 for all but a finite number of indices). 

Proof. TVivial, given definition 11 . 1.1 and using an induction based on 3.4.9. 

□ 
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11.1.8.5. It is natural to try to refine 11.1.8.4 in two directions; first, can we 
limit ourselves to families with a finite number of elements, and, if the answer 
is yes, can we find an upper bound for that number? Secondly, is it necessary 
to take the barycenters of all points of j 4? If not, of which ones? The answer 
to the second question is afforded by the theorem of Milman-Krein, cf. 11.6.8 
(see also 11.2.9). As to the first: any of the figures above will convince the 
reader that it is necessary to take families / with = d + 1. This is also 
sufficient: 

11.1.8.6. Theorem (Caratheodory). If X is an affine space of dimension 
d, the convex hull of an arbitrary subset A of X is given by 


S(A) = 



e A. A. > 0, A. = 1 


}• 


Proof. Let X = Yl!i=i AiSCii with k > d+1. Take an arbitrary vectorialization 
of X (cf. 2.1.9). Since dimX = d and k > d + 1, there exist a,- (t = 1, ■ ■ ■, k) 
such that X^*L| ctiXi = 0, a,- = 0 and not all the are zero. Set 

0 = { r e R I ra,- + A,- > 0 for all t = 1,..., }. 

This is a closed subset of R, non-empty since 0 £ 0, and distinct from R 
since the at don’t all vanish. Let r be a boundary point of 0 and j an index 
such that TOtj + Ay = 0. Then 

k k k 

Atit + r'Y^OLiXi = ^(A, -I- ra,)i,- = ’"“»)*«> 

t = l t=l f = l i^3 


and we have managed to express x as the barycenter (with positive masses) 
of fc — 1 vectors, since X^,yy(A,- -I- ra,) = 1 by construction. □ 

11.1.8.7. Corollary. If A is compact, so is C(A). 

Proof. The set 

If = I (A„ ..., A,,+ ,) e R-'^' I A, > 0 for aU t, E' = ,‘ A. = 1 J 

is compact, and 11.1.8.6 shows that S(A) is the image of K x C R*"^^ x 
under the continuous map 


■f+i 

(AI,..., A,i_^ 1}s?!,..., Xd.^-i )' ^ ^ h{X{. D 

f=i 

See 11.9.3 for a refinement of 11.1.8.6; see also 11.2.2. For a very nice appli¬ 
cation of 11.1.8.6, due to Hilbert, see [EL, especially section 5|. 

11.1.8.8. If A is bounded, diam(£'(A)) = diam(A); in particular, £{A) is 
stHl bounded. 

Proof. This is a consequence of 11.8.7.6. □ 
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11.2. The dimension of a convex set 

We recall that finite-dimensionai affine spaces have a canonical topology 
(cf. 2.7.1.4). 

11.2.1. Proposition. If S is convex, so is its closure S. 

Proof. This can be seen in two ways. One is to use the fact that the map 
(i,!/) H-» Ai + (1 — A)?/ is continuous, for all A. Another is to endow the affine 
space 5 where X sits with a Euclidean structure; then 5 = H, >0 
we conclude with 11.1.3.3. 

11.2.2. On the other hand, just the fact that A is closed does not imply that 
C(A) is also closed: see figure 11.2.2, where A is made up of the interior of 
one branch plus the center of a hyperbola. This justifies the next definition 
(the proposition is a trivial consequence of 11.2.1): 


A 


11.2.3. Definition and proposition. The closed convex hull of a subset 

A is the intersection of all closed convex sets containing A. The closed convex 
hull is identical with C(A). □ 

11.2.4. Lemma. Let S be convex, x & § and y € S points. Then \x,y\, the 
open segment with endpoints x and y, lies in §. 

Proof. Take z G \x,y\ and let U be an open set of S containing x (figure 
11.2.4). Choose a point y' in the segment \y,z\ (possible since y G S). The 
image of U under the homothety of center y' and taking z to a is open and 
lies entirely in S. □ 

o A 

11.2.5. Corollary. If S is convex, so is S, and one has S = S. □ 

It is, of course, false that § = S for arbitrary sets S (see figure 11.1.2.4, for 

example). As to the equality S = S, it may not hold even if S is convex: 
consider a liyperplane of X, for example. 

We now give a simple and nice criterion for the convex set 5 to have 
non-empty interior: 
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Figure 11.2.4 


11.2.6. Definition. The dimension of a non-empty convex set S, de¬ 
noted by dimS, is the dimension of the affine subspace (S) spanned by S 
(see 2.4.2.5); dim S = dim(5}. 


11.2.7. Proposition. If S is a non-empty convex set, dim5 = dim X if 
and only if S has non-empty interior. 

0 

Proof. If S 0 we have dim 5 = dim AT by 3.5.2, for example. Conversely, 
let (zt),'=i.<i+i be a family of affine independent points in 5; then 


d+1 


□ 


11.2.8. We can thus talk about the relative interior of a convex set 5, the 
interior of S in the affine subspace (S). 


11.2.9. Proposition. If S is a compact convex set, S = f (FVS). 

Proof. Take z € 5 and let D be an arbitrary line through z; then Z? n 5 
is a segment |u, v| containing z (cf. 11.1.2.3). Since .u, u £ FV5, we have 
zGf(FV5). □ 



11.3. Topology of convex sets 

Ill this section we deal with the classification of convex sets up to hom- 
eoniorphism (what this means will become clear as we go along). We also 
classify their frontiers. 
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11.3.1. Proposition. Let X be a d-dimensional affine space and A a 
d-dimensional convex subset of X. Then A is homeomorphic to R**. In par¬ 
ticular, all open, non-empty convex sets in a d-dimensional space are home¬ 
omorphic to R**. 

Proof. 

11.3.1.1. We start by giving X a Euclidean vector space structure, with 
origin O S A (this is possible by 11.2.7). Let 5 be the unit sphere in A. For 
y & S, let R(y) be the half-line with origin O and containing y, by 11.1.2.3 
and 11.1.2.7, R{y) n A is an interval, one of whose endpoints is O. If R(y) 
goes out of A, the other endpoint /(y) of R(t/) H A is the unique point of 
R{y) n FV(A), and we set S{y) = ||/(t/)||. Otherwise we set 6(y) = oo, and 
f(y) is not defined. 



11.3.1.2. The map 6 : S —* [0, oo) is continuous. 

By contradiction: assume first that 5(y) is finite, and let (*/„) be a sequence in 
S such that lim„_oo Vn = y- If it is not true that lim„_,oo ^(l/n) = ^(s/)i there 
exists a subsequence of (vn), still denoted by (*/„), such that 5(y,i) < 5(y) — r) 

O 

with r; > 0 or 6(y„) > 5(y) -i- with t} > 0. Since O G A, there exists an 
open ball 17(0, a), for some a > 0, contained in A, so A contains the shaded 
region in figure 11.3.1.2.1, and we get a contradiction if A( 5 /„) < 5(y) — r). If 
HUn) ^ + the contradiction comes from the fact that, by the convexity 

of A, the shaded region in figure 11.3.1.2.2 is in the interior of A. Finally, if 
^(y) = oo, we use the shaded part of figure 11.3.1.2.3. 

11.3.1.3. To conclude the proof, it is enough to recall that the interval 
[0, o[ C R, for any 0 < a < oo, is homeomorphic to [0, oo[, and that the 
homeomorphism can be made to vary continuously with o. More precisely, 
we define a homeomorphism h : A —* X a.s follows: ifx = 0orifzy^0 and 
5(z/||z||) = oo, put h(x) = x; otherwise, put 


. , , ^ ^(3:/||a:||) ' ||z|| ^ 

6(x/||x||)-||x|| ||x||- 


□ 
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Figure 11.3.1.2.1 




11.3.2. Note. From this we conclude that the frontier of a convex set has 
measure zero (cf. 2.7.4.3): on every R(y), the intersection R(y) n Fr(j4) is 
a single point, so it has measure zero; an application of Fubini and 11.3.1.2 
proves the result. See another demonstration in 12.9.2.4. 

11.3.3. Bearing example 11.1.2.5 in mind, one sees it is useless to try to 
classify arbitrary convex sets; this explains the restrictions in the statements 
below. From the proof of 11.3.1 we get; 

11.3.4. Corollary. If A is a bounded convex set such that 

dim A = dim X = d, 

the frontier FrA is homeomorphic to the sphere If A is also compact, 

A is homeomorphic to the closed ball of dimension d. In particular, if d = 2, 
FrA is a simple closed curve. 

Proof. The homeomorphism between FrA and is the inverse of the 

map y i—» f(y), well-defined because d(y) < oo for all y €: S, and 5 is always 
homeomorphic to □ 
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11.3.5. Now assume that A 15 a convex set of arbitrary dimension, not 
necessarily the same as that of the ambient space X. Applying 11.3.1 or 
11.3.4 to the affine subspace spanned by A, we complete the classification of 
all open convex sets and all compact convex sets; they are homeomorphic to 
R'* and the closed unit ball of R‘* , respectively, where d' is the dimension 
of A. 

11.3.6. Remarks on star-shaped sets. 

11.3.6.0. It is tempting to try to make the proof of 11.3.1 work for star¬ 
shaped sets (cf. 11.1.2.4): if E C AT is star-shaped at x £ j&, one can give X 
a Euclidean vector space structure and consider the unit sphere 5 centered 
at X and the half-lines R{y), for y G S. Here, however, the endpoint f(x) 
of the segment R(y) O £■ is not the unique point of R{y) O PrE'; consider 
the cat’s whiskers in 11.2.2.4. The cocoon of the star-shaped set E is the 
set { f(y) I y £ Rt^(y) < oo }; this set differs from Fr E if E is not convex, 
as we have just seen. Besides (figure 11.2.2.4 or 11.3.6.0.1) the function 6 
is no longer necessarily continuous, even if the cocoon is compact (in figure 
11.3.6.0.3 the cocoon consists of two points). 



X 

Figure 11.3.6,0,3 


On the other hand, in all the drawings above the star-shaped sets do 
seem to be homeomorphic to R‘*. This is not a coincidence; 

11.3.6.1. Theorem. Every open star-shaped set in X is homeomorphic 
to X. 



Figure 11.3.6.1.1 
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Proof. Vectorialize X at the center O = x oi the star-shaped set A. M X = A 
there is nothing to prove. If Fr A ^ 0 define the function 


^ : A 3 a: I—* ^(z) = d{x, Fr A) € . 

It is obvious that (f> is continuous. Define a map F ■. A —* X hy setting 

O fll*ll dt \ X 

-T- , 7 ,7 T-^ ■ 77^- 

„ ^(t(x/||x||))/ ||x|| 

The map F is the desired honieoniorphism. It is enough to show that F is 
continuous, bijective and proper (that is, the inverse image of any compact 
set is compact). 

Continuity: for x ^ O this follows from 11.3.6.2 and the continuity of 
4>. At O, recall that, since A is open, there exist e > 0 and k > 0 such that 
4>(x) > k for all x € U[0,(). 

Next we show that, for all y € S, 


11.3.6.3 


/, 


dt 


Hty) 


= d-oo, 


where the notation is the same as in the proof of 11.3.1. We distinguish two 
cases: if 6(y) = d-oo, let o be an arbitrary point of FVA, Then 4>{ty) = 
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d(ty, FV v4) < d(ty, a) < t + ||a||, whence 

r°° _di_ ^ r°° dt 

in Hty) ~ Jo < + ll‘ 
If 5(!/) = fc < + 00 , we have < d(ty,ky) 


t, and again 


/■'' dt ^ r* dt __ 
Jo 4>[^y) Jo ^ ~ i 


To show that F is proper, it is enough to see that the inverse image of a 

bounded set is bounded, since F is continuous and X is a finite-dimensional 

vector space. But this follows from the lower bounds given above for <l>(ty): 

if 5(!/) = +00 because the bound is uniform in y, and if 6 (!/) < -l-oo because 

f’ dt ... 

/ , . can be bounded from below by a uniformly continuous function of 

Jo v\iy) 

y (constructed from S(y)). 

11.3.6.4. Remark. In fact A is even diifeomorphic to X. See also exercise 
11.9.6. 


11.3.6.5. Corollary. Let X be a d-dimensional Euclidean affine space, S 
the unit sphere centered at O G X and C a closed convex cone with vertex O 
(cf. 11.1.6). Assume there exists y G S nC with —y ^ S nC. Then S \ C is 
homeomorphic 

Proof. Set M = S n C and y' =■ —y. For any half-great circle 7 of 5 with 
endpoints y and j/ we have y € -tOM and j/' ^ 7 O M\ also, 7 O A/ is convex 
because C is convex. Thus 7 fi (5 \ M) is a half-open arc |«/', '| of 7 . In 
particular, denoting by a the stereographic projection (cf. 18.1.4.3) with pole 
y from 5 onto the hyperplane H tangent to S at y', we conclude that tT(S\M) 
is a subset of H star-shaped at y', and open since C is closed. 11.3.6.5 now 
follows from 11.3.6.1 because of 18.1.5. □ 
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11.3.7. In 11.3.5 we have classified open and compact convex sets. We now 
must look at unbounded convex sets. Figures 11.2.1.1, 11.2.1.3, 11.2.1.6 and 
11.2.1.5 show frontiers homeomorphic to R X {—1, + 1}, R x 5^, R and 5*, 
respectively. This phenomenon is general: 

11.3.8. Proposition. Let A be a convex subset of X such that dimj4 = 
dimX = d and Frj4 ^ 0. Then FVj4 is homeomorphic to R' 

R" (0 < r < d - 1). 

Proof. Same notation as in 11.3.1.1; if j 4 is bounded, we are done with 11.3.4. 
Otherwise, set Af={!/G5|5(t/) = oo}. From the convexity of A we know 
that C = Uj,eA/ f^(y) ^ 3, closed convex cone. 




Figure 11.3.8.1 


11.3.8.1. First case: M O ( — M) = 0, that is, y & M implies —y ^ M. 
The set M does not contain antipodal points, or, which is the same, A does 
not contain entire lines through O. The hypotheses of corollary 11.3.6.5 are 
satisfied and Pr j 4 is homeomorphic to 5 \ M, hence to R'*“^. 
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11.3.8.2. Second case: there exists y G M such that —y G M, that is, 
A Zi D, where D is the affine line through O and y. Let C be an affine 
subspace of maximal dimension contained in A. Its dimension r must be less 
than d, otherwise A = X and Pr j 4 = 0. Let IV be a subspace complementary 
to V; IV is an affine subspace of dimension d — r and IV O = B is a convex 
set in IV. For x G IV, take t/ G V and make y go'to infinity in V in all 
possible directions; since [z, y\(l A and V C .4, an limit argument shows that 
A contains V^, the affine subspace parallel to V through x. Thus A Zi V X B, 
and the same argument shows that A <Z V x B. Since PrV = 0, we have 

FrA = Prl= Fr(V x B) = V x FrB = V x FrB. 

The proposition now follows from 11.3.4 or 11.3.8.1, since the maximality of 
V implies that no antipodes can occur in B. 



11.3.9. Note. The technique used in the demonstration of 11.3.8.2 is quite 
general, and has recently been utilized in the context of a certain type of 
Riemannian manifolds ([C-G]). See also |B-B-M]. 

11.3.10. Notes 

11.3.10.1. A convex curve in a Euclidean plane X is, by definition, the 
boundary of a two-dimensional convex subset of X. Proposition 11.3.8 shows 
that a convex curve is homeomorphic either to the circle 5^ or to the line R 
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(compare with the classification of one-dimensional differentiable manifolds, 
in [B-G, 3.4], for example). The reader can also study, from the present 
viewpoint, the proof of theorem 9.6.2 in [B—Gj; see 11.5.4 below. 

11.3.10.2. A convex surface in a Euclidean three-dimensional space X is, 
by definition, a connected set that is the boundary of a convex subset of X. 
Such a surface is, by 11,3.8, homeomorphic to R', 5^ X R (the cylinder) or 
5^. Convex surfaces have been the object of refined study: see, for example, 
[PVlj, [BUlj, [AWlj. For the differentiable case, see [DE4, volume IV, p. 360, 
problem 3|. 

11.3.10.3. Again from 11.3.8 it is easy to derive a topological classification 
of closed convex sets. 

11.4. Convex sets and hyperplanes. Separation 
theorems 

This whole section is based on the following result; 

11.4.1. Theorem (HaHN-BaNACH). Let X he an affine space, A an 
open, non-empty convex subset of X and L an affine subspace of X that does 
not intersect A. Then there exists a hyperplane of X containing L and not 
intersecting A, 



One cannot eliminate the condition that A b open: for instance, take A 
to be an open half-plane together with a segment in its frontier, and for L a 
point ill the frontier. See also exercise 11.9.9. 

A theorem like 11.4.1 is non-trivial; for example, if A = U(a,r) and 
L = {z} is contained in 5(o,r), the hyperplane obtained is unique, namely, 
the hyperplane tangent to 5(o,r) at x (cf. 10.7.4). 
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Proof. 

11.4.1.1. The first step is to reduce the problem to the two-dimensional case. 
Let M D L be a subspace of X of maximal dimension satisfying MnA = 0; we 
want to show that Af is a hyperplane. We first vectorialize X at some point 
O G L, and let p ; Af —» X/M be the canonical projection onto the quotient 
XjM. Then p(A) is open and convex in XjM (11.1.2.6), and O ^ p(A). Let 
Z C XjM be a vector subspace of dimension two intersecting p(A)-, such a 
plane exists because dim(Ar/Af) > 2, otherwise there is nothing to prove. The 
intersection p(j4) O Z is a non-empty, open convex set B. 




Figure 11.4.1.2.2 



Figure 11.4.1.2.3 


11.4.1.2. We now have to show that it is impossible for every line in Z going 
through O to intersect B (where B is an open convex set in Z not containing 
O). Heuristically, if we give Z an arbitrary Euclidean structure, the half-lines 
through O that intersect B determine on the unit circle 5^ of Af a subset 
that cannot contain antipodal points, for a line joining two antipodal points 
would force O G B. Since B is convex, thb subset of 5^ is also connected, 
so it must be an arc of length less than n, and there is a line that does not 
intersect it. We now have to make this argument rigorous. 
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Figure 11.4.1.2.4 Figure 11.4.1.2.5 

We introduce the cone C = Ua>(> cone is open, convex (an easy 

exercise left to the reader), and does not contain O. (In figure 11.4.1.2.2, 
where B is an open disc whose frontier contains O, C is a half-plane.) There 
exists at least one frontier point a: of C in Z \ O, otherwise C\0 would be a 
whole connected component of Z\0 (this can occur for R\0, but dimZ = 2 
forces Z \ O connected, cf 8.3.8). Thus x ^ C because C is open; but also 

—X ^ C, otherwise —x G ^ and x ^ C would imply O G by 11.2.4. But 
now the line D through x and O is entirely outside C, hence outside B, and 
we’ve found our line. □ 

11.4.2. Hahn-Banach’s theorem has fundamental applications in functional 
analysis, based on the correspondence between hyperplanes and linear forms: 
see, for example, [LG2, 186], [M-T, 23], 1BI3, 65]. In this text, however, we 
deal only with the “geometric” applications of the Hahn-Banach theorem. 

11.4.3. Definition . Let X be an affine space, A and B subsets of X and H 
a hyperplane. We say that H separates A and B if A and B lie in different 
half-spaces determined by H . In addition, if A and B do not intersect H, 
they are strictly separated by H . 

11.4.4. Corollary. Consider, in an affine space X, two non-empty con¬ 
vex sets A and B, with A open and A Ci B = 0. There exists a hyperplane 
separating A and B. 

Proof. Vectorialize X and define C = — B as in 11.1.3.1; C is a convex 

set, open (being the union of open sets) and 0 ^ C because j 4 O B = 0. Now 
apply 11.4.1 to C and the affine subspace (0). □ 

Since H is the frontier of each of the subspaces determined by it, we have: 

11.4.5. Corollary. If A and B are disjoint, open, non-empty convex sets, 

there exists a hyperplane strictly separating A and B. □ 

That would be false if the two sets were closed (figure 11.4.5). On the other 
hand, if one is closed and the other compact, we have: 

11.4.6. Corollary. If A,B are disjoint convex sets, A is closed and 
non-empty and B is compact, there exists a hyperplane strictly separating A 
and B. 
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Figure 11.4.5 


Proof. FVom point-set topology, there exists an open ball U(a,i), with e > 0, 
such that (a -t- U(a,c)) D [B + U{a,e)) = 0; but these two sets satisfy the 
hypotheses of 11.4.5. □ 

In the case of two general closed convex sets, all we can say is this; 

11.4.7. Corollary. If A and C are closed convex sets, there is a hyperplane 
separating A and C. 

Proof. Consider a Euclidean structure on the affine space; take a e A and 
consider the closed ball B(a,n), for n € N. By assumption, the convex sets 
A n B(a,n) and C satisfy the hypotheses of 11.4.6 for any n; let Hn be a 
hyperplane separating those two sets. It is enough now to show that one can 
find a subsequence of the Hn converging toward a hyperplane. We first find a 
subsequence such that the orthogonal directions to the hyperplanes converge; 
this is possible because the projective space of lines going through a is compact 
(4.3.3). From this sequence we can now extract a convergent subsequence by 
considering a convergent subsequence of the sequence of points Hn H [a, c], 
where c 6 C is an arbitrary point (remember that |o, c] is compact). 

We can now study the polarity operation introduced in 11.1.5; 

11.4.8. Proposition. Let X be a Euclidean vector space with origin 0. 

O 

i) If A is bounded, 0 G A*; t/ 0 G A, the set A* is bounded. 

ii) If A is a convex closed set containing 0, we have A** =■ A. 

Proof. Assume A C B(0,r) for r > 0. Then A* 3 (S(0,r)) = B(0, r“^). 

Similarly, if there exists r > 0 such that B(0, r) C A, we have A* C 

(B(0,r))* = B(0,r-i). 

Let A be a closed convex set containing 0; definition 11.1.5.1 gives the 
inclusion A C A**. Take o ^ A; by 11.4.6 there exists a hyperplane H that 
strictly separates A and o, so in particular 0 ^ H. Let h be the pole of H 
(cf. 10.7.11), that is, B = { z & X \ (z | h) = l}; then (a | h) > 1 and 

(z I h) < 1 for every z G A, so o ^ A**, and A** C A. □ 
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11.5. Supporting hyperplanes; applications 


An important case of separation is when A is convex and B is a point 
outside A. We’re led to the following definition: 

11.5.1. Definition. Let A be an arbitrary subset of an affine space X. A 
supporting byperplane for A is any hyperplane H containing a point x € A 
and separating {z} and A. We say that H is a supporting hyperphne for A 
at X. 
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By the above and the remark preceding 11.4.5, a supporting hyperplane 

O 

H for a convex set A satisfies H H A = 0. On the other hand, if If is a 
supporting hyperplane at x, we have a: G Fr A. 

The figures above show that supporting hyperplanes may not exist at 
all points of the frontier, that they may not be unique at any given point, 
and that they may be supporting at more than one point. This will be made 
precise in 11.6. 

11.5.2. Proposition. Let A be a dosed convex set. Then A has a support¬ 
ing hyperplane at any point of its frontier. 

Proof. A point x iu the frontier satisfies {i} n A = 0; just apply theorem 
11.4.1 with L = {z}. □ 

Using the polarity transformation of 11.1.5.1, one can show that there is a 
duality between points of Pr A and supporting hyperplanes for A: 

11.5.3. Proposition. Let X be a Euclidean vector space with origin 0, 
and A a closed convex subset of X containing 0 in its interior. The polar 
hyperplanes of points of Fr A make up the set of supporting hyperplanes of 
A*. Moreover, if x € Fr A has polar hyperplane H, the contact points of H 
are the poles of the supporting hyperplanes for A at x. 


X 



Figure 11.5.3 
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Proof. Take z G FV A. By definition 11.1.5.1, A* is contained in the half-space 
determined by H, the polar hyperplane of x (cf. 10.7.11). Similarly, if T is a 
supporting hyperplane for A at z, with pole p, we have p €: H (cf. 10.7.11) 
and p G A* because A** = A and A lies in the half-space determined by H 
and containing 0. Thus H is indeed a supporting hyperplane for A* at p. 
Since A** = A, one obtains in this way all the supporting hyperplanes for A* 
(cf. 11.4.8). □ 

We can now fulfil the promise made in 11.1.7.4: 

11.5.4. Proposition. Let A be a dosed set with non-empty interior. If A 
has a supporting hyperplane at every point of its frontier, A is convex. 

0 

Proof. We argue by contradiction. Take z G A, and assume there exist 

0 

points p, z G A and t G [y, z] such that t ^ A. Since t ^ A and z G A, 
the segment [z, t] intersects PrA in at least one point u G ]z, i[. Let H be 
a supporting hyperplane for A at u; its intersection with the affine plane 
spanned by z, y, z gives a line containing u. Now u is in the interior of the 
triangle {z, p, z), so every line through u strictly separates either z and p or 
z and z. Contradiction. □ 



Figure 11.5.4.2 


The non-empty interior condition is essential; see figure 11.5.4.2. The reader 
should compare the demonstration above with the one in [B-G, 9.6). 

We can also state a converse; 

11.5.5. Proposition: Let A be a dosed convex set. Then A is the inter¬ 
section of the closed half-spaces containing A. 
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Proof. Call that intersection C'{A). Then £'(A) is a closed convex set (cf. 
11.1.2.7) containing A. Assume there is x G ^'(A) \ A; a contradiction is 
obtained by applying 11.4.6 to A and S = {z}. □ 

11.5.6. Remarks. There exists a more elementary way, without using 
Hahn-Banach, to find supporting hyperplanes for A (without specifying the 
contact point): 

11.5.6.1. Let A be a compact subset of X and V a hyperplane direction of 
X (cf. 2.4.1). There exists at least one supporting hyperplane for A parallel 
to V. 

Proof. Let p : X —* X/V be the projection of X onto the quotient (cf. 2.2.4); 
p(A) is a compact subset of the affine line X/V. U a, ^ are its extremities, 
which possibly coincide, p“*(a) and p~^(y9) are hyperplanes parallel to V, 
and are supporting at every point of their intersection with A. □ 

11.5.6.2. This also shows that there exist at least two such hyperplanes if 
0 

A ^ 0. On the other hand, if A is closed but not compact, such hyperplanes 
may not exist; figure 11.5.6.2. 



Figure 11.5.6.1 Figure 11.5.6.2 


11.5.6.3. If X is also Euclidean, X/V has a natural Euclidean structure, and 
the length (that is, the diameter) of the compact set p(A) is called the width 
of A in the direction £ = and denoted by widj A. Convex sets with same 
width ill all directions have been studied by many people, but there are stUl 
many open problems on the subject, even in dinieiision two. In 12.10.5 we 
shall calculate the length of a curve of constant width; the reader can find 
drawings and references there. 
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11.5.6.4. A notion related to supporting hyperplanes is that of supporting 
function: see 11.8.12.3. 

11.5.7. The notion of convexity allows one to state results about the smallest 
ball containing a given compact set in a Euclidean space: 

11.5.8. Theorem (Jung). Let A be a compact subset of a d-dimensioned 
affine space X. There is a unique ball B(x, r) of minimal radius containing 
A. Furthermore, we have x€ f(An5(a:, r)) and 

and this is the sharpest possible such inequality. 




Proof. For every t > 0, set Ft = { i G | B(y, t) D A }. Since A is bounded, 
the set r = { < G R I Ff 7^ 0 } is non-empty; observe also that t < t' implies 
F( C Yf and that the Y, are compact. Since a decreasing sequence of compact 
sets has non-empty intersection (see 0.4), we have F^ = nt>r ^ 

If we had x, y €:Yr with x ^ y, we would conclude, from figure 11.5.8.2: 

< r and r^ - ^ G T, 
whence F^ = {z}, a contradiction. 

Now vectorialize X at x. Let u € S[x, 1) and e > 0 be arbitrary; by the 
definition of x and r, there exists a G A such that 

d(x, a) < r < d[a, eu). 

Since A is compact, we can take the limit as e —♦ 0 to obtain, for every 
u G 5(0,1), a point a G A such that (a | w) <0 and d(x, a) = r. This shows 
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Figure 11.5.8.4 


that X € f (a n 5(z,r)), for otherwise there would be a hyperplane strictly 
separating x and An5(z, r) (by 11.4.6), contradicting the result above applied 
to the vectors u and — u: 

{u} U {—u} = ■S'(a:, 1) n H'^. 

To show the inequality in the statement, we use Caratheodory’s theorem 
(11.1.8.6) to write x — Yli=i will* ■^t ^ Oj — 1 and each a; in 

5(z,r)nA. In particular, d(a i,aj) < 6 = diam A for every i ^ j. For a fixed 
i, we can write 1 — ~ whence 

(1 - A.)5^ > '^Xj<P[ai,aj) = ^ Ajd^(a., ay), 

3 ^ 


3 
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since d(ai,ai) = 0. But 

(P(ai, a,) = ||ai|p + ||aj||^ - 2(a,- | ay) > 2r^ - 2 ( 0 ; | ay), 

whence 

(1 - Xi)6^ > Ay^ 2 r 2 ~ 2 ^ 0 ; j ^ Ayay^= 2r^ - 2 (a. | x) = 2r^ 

(remember we vectorialized X at 2 , so a: = 0!). Adding over i we get 
^(1 - A;)i5^ = dS^ > 2(d+ l)r^ 

t 

as desired. Ekiuality can only obtain if d(a;, ay) = 6 for every i ^ j, which is 
the case for the regular simplex in the d-dimensional hyperplane A; = 1 : 

Sj = |(A,,...,Arf+i) G I A. > Ofor all t and ^A; = 1 

^ i 

(cf. 12.1.2.5). 

11.5.9. Notes. 

11.5.9.1. In the bargain, theorem 11.5.8 gives the value of the radius of the 
sphere circumscribed around the regular simplex Sj (cf. 9.7.3.7). See also 
11.9.18. 

11.5.9.2. Elquality in 11.5.8 can be achieved for sets A other than regular 
simplices, for example, the Reuleaux triangle (figure 12.10.5.1). 

11.5.9.3. Observe that the point x obtained, although canonicaUy associated 
with A, differs from its centroid (cf. 2.7.5.6). 

11.5.9.4. Observe also that one may have #(5( z,r) n A) < d -t- 1 , that is, 
some of the points a, in the proof may coincide (figure 11.5.9). 




11.5.9.5. One can also study balls of maximal radius contained in a compact 
set A; they always exist, but they are not unique. There exists an equality 
for this maximal radius, involving the minimal width of A: see 11.9.12 or 
|EN, 112). 
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11.6. The frontier of a convex set 

We shall now examine the different kinds of frontier points a convex set 
can have. 

11.6.1. Definition. Let A be a closed convex set in a d-dimensional space 
X, and let x be a frontier point of A. We say that x has order a if the 
intersection of all supporting hyperplanes to A at x is an affine subspace of 
dimension a. If x has order 0 we say it is a vertex of A; on the other hand, 
if a = d — 1, that is, if A has only one supporting hyperplane at x, we say 
that A is smooth at x. 





In a circle (figure 11.6.1.1] all frontier points are smooth, and there are 
no vertices. A d-dimensional simplex has frontier points of every order up to 
d— 1 (figure 11.6.1.2); the same is true of any d-dimensional polytope (12.1.9). 
Figure 11.6.1.3 shows a convex set with an infinite number of vertices, but no 
convex set can have more than a countable number of them: 

11.6.2. Proposition. Any convex set has only a countable number of 
vertices. 
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Proof. We start by giving X a Euclidean structure. For any z G PrA, we 
define CN^, the norniai cone to A at x, as the union of half-lines originating 
at X, orthogonal to supporting hyperplanes to A at x, and lying on the far 
side of such hyperplanes with respect to A. In sym'bols, 

CNx = { y G I (xy I xz) < 0 for any a G A }. 

If A is smooth at x, the cone CN^ is just a half-line; on the other hand, saying 
that z is a vertex is equivalent to saying that CN^ has non-empty interior. If 
z, y G Pr A are distinct, the cones CNx and CNy cannot intersect, for if they 
intersected at z, say, the triangle {x,y,z} would have two angles > tt (by 
assumption, (zt2 | z3) < 0 for any w & A, hence for w = y € A; and similarly 
I < 0). But by a classical argument, involving a countable dense 
subset, one can only fit a countable number of disjoint sets with non-empty 
interior in Jf. □ 



Figure 11.6.2 


11.6.3. For a sharper theorem on vertices, see |VE, 136]. 

11.6.4. Definition. Let A be convex. A point z G Pr A is called exposed 
if there exists a supporting hyperplane H at x such that H r\ A = {z}. A 
point z G A ts called extrema] if the equality z = (y -f- z)/2 with y,z ^ A 
implies y = z. A convex set is called strictly convex if all its frontier points 
are exposed. 

11.6.5. Notes. 

11.6.5.1. A vertex is always extremal, but the converse is false (see figure 

11.6.5.1) . But we shall see in 12.1.9 that the converse is true for polyhedra. 

11.6.5.2. An exposed point b extremal, but the converse is false (figure 

11.6.5.2) . 
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11.6.5.3. The set of extremal points of a compact convex set is not necessarily 
closed (figure 11.6.5.3), unless d=2 (exercise 11.9.8). 

11.6.5.4. If x is extremal, z € Pr A. Saying that x b extremal is equivalent 
to saying that A \ x is still convex; see figures 11.1.2.7 and 11.1.2.8. 

11.6.5.5. One of the reasons for interest in extremal points is proposition 
11.8.10.9. 

11.6.6. Examples. 

11.6.6.1. The extremal points of a. compact interval are its endpoiuts. 

11.6.6.2. Extremal points (in infinite-dimensional spaces) play an important 
role in measure theory; see [DE4, vol. II, p. 151, problem 8]. 

11.6.6.3. Extremal points are also encountered in applied math; for example, 
the set of bistochastic matrices is convex, and its extremal points are the 
permutation matrices (see 11.9.7, [KE, chapter 21] and [R-V, chapter Vj). 

11.6.7. We can now answer the question posed in 11.1.8.5; 

11.6.8. Theorem (KREIN-MilMAN). A compact convex set is the convex 
hull of its extremal points. 

Proof. For every convex set A, denote by Extr A the set of extremal points 
of A. Definition 11.6.4 easily implies that 

Extr( A n H) = (Extr A) n If 
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Figure 11.6.8 


for every hyperplane H of A; this will be used to prove 11.6.8 by induction 
on the dimension d of X. The theorem is true for d = 1 by 11.6.6.1; assume 
it is true for d — 1. By 11.2.9, the assertion A = S(ExtrA) will follow from 
FV A C f (Extr A). Take a: G Pr A and let If be ^ supporting hyperplane to A 
at X (cf. 11.5.2). Since An if is a convex set in .ff we have, by the induction 
hypothesis, 

a: G.ffnA = f(Extr(An.ff)) = f((ExtrA)n^) 

= f (Extr A) n if C f (Extr A). □ 

11.6.9. For other results on frontier points, see [VE, 138-139). 

11.7. Helly’s theorem and applications 

This very geometric theorem has many spectacular applications. It b 
interesting to observe that it was not proven until 1921, and that Kras- 
nosel’skii’s was proved even more recently: 1946. 

11.7.1. Theorem (HELLY). Let X be a d-dimensional affine space and J 
a family of convex subsets of X with more than d + 1 elements. If J satisfies 
the following two conditions: 

i) the intersection of any d + 1 sets in J is non-empty, 

ii) T is finite or all the elements of J are compact, 
then the intersection of all the sets in J is non-empty. 

11.7.2. Remarks. The convexity condition is necessary: figure 11.7.2,1, 
where the elements of 7 are arcs of circles. The number d + 1 in condition 
(i) is also best possible; figures 11.7.2.2 cind 11.7.2.3, where the sets are faces 
and edges, respectively. Condition (ii) is necessary: take 

7 = {|n, oo[ I n G N) 

(figure 11.7.2.4). Finally, notice that, in some sense, theorem 11.7.1 is a 
constructive one. 

11.7.3. Proof of Belly’s theorem 

11.7.3.1. First case: 7 is finite and all its elements are compact. An 
elementary set-theoretical argument reduces the problem to the case ff7 = 
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C3 

Figures 11.7.2.3 Figure 11.7.2.4 


d + 2. We then apply induction on d = dim AT; the result is trivial if d = 0. 
Let J = {Cl,..., Cd+2}i 3.nd assume that nf= f = 9; by condition (i) we 
must have 

.4 = Ci n-- nCrf+i #0 and AnC ^^2 = 9. 

Let H he a. hyperplane strictly separating A and Ch^2 (cf. 11.4.6), and con¬ 
sider the d-l-1 subsets HnCi ,..., HnC,i^i of H. They satisfy (i) in dimension 
d — 1, because the intersection of any d of the sets Ci,..., C,j+i meets both 
A and Crf+2 (the latter by the assumptions on /), and so meets H by con¬ 
vexity. By the induction hypothesis we then have .ff O Ci n • • • O C.j-^i = 0, 
contradicting the choice of H. 

11.7.3.2. Second case: / is not finite but all its elements are compact. By 

11.7.3.1 we know that every finite subfamily of 7 has non-empty intersection; 
by a simple result from point-set topology the intersection of 7 itself is also 
non-empty. We recall the proof; suppose that for any x G Fi there exists 
TV G 7 such that x ^ TV- Choose an open neighborhood of x such that 
Ur n TV = 0. By compactness, we can cover Fi with finitely many open sets 
Ux,, ; Ux^, contradicting the assumption that Fi n n ■ • ■ O Fx^ ^ 0. 

11.7.3.3. Third case: The sets in 7 are not necessarily compact but 7 is 

finite. As in 11.7.3.1, we can reduce the problem to the case #7 = d+ 2. Let 
7 = (Cl,..., C 4 + 2 } be such that every subfamily with d+1 elements has non¬ 
empty intersection. We shall construct compact sets Tfi C Q (i = 1,..., d-|-2) 
such that the intersection of every subfamily with d + 1 elements is still non¬ 
empty; the result will then follow from 11.7.3.1. The construction of Ki is as 
follows; by assumption, there exists pi (» = 2,...,d + 2); we set 

^1 = £(P 2 j ■ • ■ jPd+ 2 )- Clearly Ki C Ci and the family (TCi, C2 ,... ,C,i4.2} 
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still satisfies (i), since Ki O H Ci 9 Pi for every i = 2,..., <i + 2. The sets 
Ki (t = 2,...,d + 2) are constructed similarly. □ 

11.7.4. Remarks. 

11.7.4.1. For a linear algebra proof of the third case, due to Radon, see 
11.9.11. 

11.7.4.2. There is a close link between Belly’s and Caratheodory’s theorems: 
see [EN, 39|. See also a third proof of Belly’s theorem in |VE, 72-73]. 

11.7.5. Corollary. Consider, in an affine plane X, a finite family J of 
disjoint parallel segments such that any three of them enjoy a common secant. 
Then the whole family 7 has a common secant. 



Figure 11.7.5 
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Proof. We first deal with the case where two segments lie on the same line: 
then all the segments must lie on that line, and we’re done. Otherwise, take 
coordinates in .Y so that the j^-axis lies in the direction of the lines. For 
5 G /, set 5' = { (ct,/3) G | line y — ax + P intersects 5 }. It is 
immediate to verify that S' is convex. But, by assumption, any three sets 
S' have a common point, so they all have a common point (a,/9), and we’ve 
found a line y — ax + intersecting all the sets in J. □ 

The next corollary can be summarized by saying that a convex compact set 
in a d-dimensional affine space is d-quasi-symmetric. Naturally, if d 1, 
every segment is symmetric with respect to its center. For other results along 
these lines, see, for example, exercise 12.12.21 or |VE, 190, proposition 12.5]. 
See especially the extraordinary result of Dvoretsky [DVj which says that, 
for appropriately chosen dimensions, a convex sets always has quasi-spherical 
sections. 


11.7.6. Corollary. For every compact convex set A in a d-dimensional 
affine space X, there exists at least one point z & A such that every chord 
[u, uj of A containing z satisfies (cf. 11.2.9 and 2.4.6): 



< d. 



Figure 11.7.6 


Observe first that d is the best possible bound, as attested by the example 
of a simplex, for which, as the reader should check, the only possible z is the 
centroid. 


Proof. For every x G A, introduce the set 


A. 


1 d 

dTT d+T 


A 


(cf. 11.1.3), which is simply the image of A under the homothety 

(cf. 2.3.3.8). The sets Ax, for x G A, satisfy the assumptions of Belly’s 
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theorem, in the compact case: if {a:i)i=i.,i+i C A, their center of mass 


belongs to all the sets Ax, , since A is convex and 


y = 


Xi + 


d 


d+i'' 




G A, 




(cf. 11.1.8.4). Now take z G l“> *^1 ^ chord through a; to 

say that a G Au means that z G /fu..i/(.i+il (l*^) *^1)) 



uv 


d 

d "I" 1 * 


whence 



vz 


Switching u and v completes the proof. □ 

Here’s a useful application of the next corollary in two dimensions: if, for any 
three paintings in a museum, there is a point from which all three can be 
seen, then there is a point from which aU the paintings in the museum can 
be seen at once (you may need binoculars, but at least you can sit down...) 
Of course this is the same as saying that the museum is star-shaped. 



11.7.7. Corollary (KRASNOSEL’SKII). Let A be a compact set of the d- 

dimensional affine space d. Assume that, for any subset {a5t}i=i d+i *= 

there exists y & X such that [a:i,y] C A for i — 1 . d' + 1. Then A is 

star-shaped. 

Proof. The basic idea is to introduce, for every x G , the largest star-shaped 
set V.x centered at x and contained in A, that is, 

Vx= [y & A\\x,y\c a). 




Figure 11.7.7 2 


The sets are compact, and by 11.1.8.7 so are their convex hulls £ (V'j). 
The hypothesis of the corollary is exactly that any (d + l)-eleinent family 
of sets Vx, whence also of sets f (V®), has non-empty intersection. Thus by 
Belly’s theorem (11.7.1) there is a point y€ Hae/t "'*** ®how that 

A is star-shaped at «/, the difficulty being, of course, that we do not know o 
priori that y is in the intersection of the (observe that we cannot apply 
Belly’s theorem to the V^, which are not necessarily convex). 



The proof is by contradiction. Let a: G ^4 be such that [z, j/J ^ A. 
Introduce a Euclidean structure on X. Let x' be the farthest point from 
X on [z,!/) such that [z, z'] C A, and let u be an arbitrary point in [z, j/] \ A. 
By continuity, there exists w g]z', u| such that d[w,x') < d(u,A). Now take 
s G [n, iw] and t G A such that d(x,t) = d(\u,w], A). We have s ^ u by 
construction. Applying 11.1.7.2 twice, once to s and A and once to t and 
[u, lu], we see that (sf | sj/) < 0 and that Vt (whence also £ (Vt)) lies in the 
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half-space separated from x by the hyperplane H containing t and orthogonal 
to st. But this implies y ^ D 

11.7.8. For other applications of Helly’s theorem, see [VE, part 6], [EN, 
chapter 2] and [D-G-K]. 


11.8. Convex functions 


11.8.1. We recall that a function / : / —» R, where I is an interval of R, is 
called convex if, for any A G [0,1| and any x, t/ G /, we have 

/(Ax + (1 - A)t/) < A/(x) + (1 - X)f{y). 

We can generalize this definition as follows: 

11.8.2. Definition. Let A CZ X be a convex set in an affine space X. A 
map / : A —♦ R is called convex if, for every A G |0, 1] and any x,y €: A, we 
have 

/(Ax + (1 - A)t/) < A/(x) + (1 - A)/(t/). 

Such a condition only makes sense if A is convex. An equivalent definition 
requires the concept of the epigraph of a function, that is, everything above 
and including its graph: 

11.8.3. Definition and proposition. Let the epigraph of a map f : 
F —» R, where Y is arbitrary, be the set 

Epigr(/) = { (x,t) G y X R I X G A and t > /(x) 

A necessary and sufficient condition for a map f : A —• R on a convex set A 
to be convex is that its epigraph be convex. 

Graph of / Epigraph of / 




An easy induction argument shows that, for a convex function /, 

11.8.4 

for any non-negative scalars A, such that 52,- A, = 1 and any points x, in A. 
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11.8.5. Definition. A function f : A —* "R, where A is convex, is called 
strictly convex if, for any A G jO, l[ and any x,y G A with x ^ y, we have 

f{Xx+(l-X)y)<Xf(x) + (l-X)f(y). 

11.8.6. Definition. A function f : A R is called concave if the function 
— f is convex, that is, if 

f{Xx + (1 - A)y) > A/(x) + (1 - X)f(y) 

for any A G [0,1] and any x,y € A. 

11.8.7. Examples. 

11.8.7.1. Figure 11.8.3 shows that a convex function is not necessarily con¬ 
tinuous. In fact, a convex function on a compact set does not even have to 
be bounded; just take A = fl(0,1) C and make / = 0 in the interior of 
A and / positive, but arbitrary, on the frontier. On the continuity of convex 
functions, see 11.8.10.4. 

11.8.7.2. Affine forms f : X R (cf. 2.4.8.3) are convex, but certainly not 
strictly convex, since equality always holds in 11.8.4, by 3.5.1. 

11.8.7.3. The function x •—* x^, defined on R, is strictly convex, but x * |x| 
is ouly convex. 

11.8.7.4. If /, ; j 4 —> R is convex and c, > 0 for all i, the linear combination 

Yli ^ifi convex. 

11.8.7.5. If /. : A — R is convex for aD t, the suprenium of the /, is 
convex. This is seen by using 11.8.3, since the epigraph of the suprenium is 
the intersection of the epigraphs of the 

11.8.7.6. If x,y,z,t are four points in a Euclidean affine space, the function 

[O, l] 3 A ' —► d{^Xx -f- (l “ A)y, Xz 4- (l — £ R 

is convex. In fact, 

d{Xx + (1 - A)j/, Az + (l — A)t) = ||Azz + (l - A) t/t ||, 
and the desired inequality follows from the proof of 9.2.2. 
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We now present two somewhat less trivial constructions for convex func' 
tions. These examples will be essential later: see 12.11.3 and 15.5.9. 


11.8.8. The theorem of Brunn-Minkowski 

11.8.8.1. Theorem (Brunn-Minkowski). Let A and B be compact sets 
in a d-dimensional affine space X, and let £. be a Lebesgue measure on X. 
The function 

(0,1] 9 A ^ L{XA + (1 - X)bY'‘^ G R 
is concave (cf. 11.1.3 for the definition of XA + (1 -A)S). 

Proof. Consider on Af a Euclidean structure with Lebesgue measure £, and 
an orthonormal frame for this Euclidean structure. Let 7 be the family of 
open parallelepipeds with edges paraUel to the axes in our frame. 

11.8.8.2. First case. The sets A and B are in 7. Denote by a,-, 6,- (t = 
1,..., d) the sides of A and B. The set XA-i-fiB is in 7 and has sides Xoj-hf^bi, 
for any X, fi > 0. We thus have 

£(/!)=+/iS)= 

i i i 


Set u, = 
we have 


a.- 


Ao, + fibi 


and Vi = 


bi 


Ao,- + fibi 


, so that Au,- + /it), = 1; by 11.8.11.6 


£(A^’ 







11.8.8.3. Second case. Assume that A = IJ^i A,- and B = 
where Ai, By G 7 for all i,] and A,- O A,- = 0, By n By = 0 for all t ^ t', 
y ^ The proof is by induction on m+n. Assume, for example, that m > 1; 
then there exists a hyperplane H, parallel to a coordinate hyperplane, and 
strictly separating at least two sets A,. We can write A = A"^ U A~, where 
A'*' and A~ are of the same type as A, but composed of only and m~ 
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little paraillelepipeds, where m"^ < m and m~ < m. By continuity, there 
exists a hyperplane K, parallel to H and splitting B into pieces B~ such 


that —r = - { . Further, B+ and B~ are of the same type as B, 

C\A-) jC(B“) 

with n"^ < n and n~ < n. We can now apply the induction hypothesis to 
+ pB+ and XA~ + /xB“, obtaining 


£(A^ + /iB) = £(A^+ + /iB+) + £(A.4- + /xB") 

> {XC{A+yl^ + At£(B+ )*/“')'' + {XC{A-fl^ + iiC{B-fiy 
= {XC{AYI^ + /i£(B)^/Y- 


11.8.8.4. Third case. Now let A and B be arbitrary compact sets. It is 

a standard result in Lebesgue measure theory that A and B can be approx¬ 
imated by sets A„ and B„ as in 11.8.8.3 and satisfying |£(.A) — £(.A„)| < e, 
|£(B) — £(B„)| < £. This leads to the desired conclusion. □ 

11.8.8.5. Corollary. For arbitrary compact sets A, B we have 

£(vl-l-B)‘/‘'>£(v4)*/‘'-l-£(B)*/‘'. □ 


11.8.8.6. Remark. One can show that the function in 11.8.8.1 is strictly 
concave if and only if A and B are not homothetic (with a positive ratio). 
The proof is more delicate, cf. [EN, 97] or [HR, 187). 

11.8.9. The function of Loewner-Behrend 

11.8.9.1. Recall the notation introduced in 8.2.5.2: Q{E) is the space of 
Euclidean structures on a d-dimensional real vector space E, and forms a 
convex open set in the vector space Fix on B a Lebesgue measure 

£. Given q E Q(E), consider the ellipsoid 9“*(1) (cf. 15.3.3.3), whose convex 
hull is the solid ellipsoid £(g) = 9~*([0, Ij). We wish to calculate the volume 
£(£(q)) of this ellipsoid (cf. 9.12.4). 



11.8.9.2. Let A (resp. A') be the matrix of q with respect to a basis B (resp. 
B') of B (cf. 13.1.3.6); then A' = ^SAS, where S is the matrix with respect to 
B of the map / E Isom(B) such that /(B) = B'. In particular, if B and B' are 
unitary bases of B with respect to £, that is, if their associated isomorphisms 
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R'^ —♦ E are as in 2.7.4. 2 , we have det5 = 1 and detA = detA'. Thus it 
makes sense to define the determinant of q with respect to £: 

11.8.9.3 det£ q = det A, 


where A is the matrix of 5 in any unitary basis B. We will show that 
11.8.9.4 i:(f ( 9 )) = /3(d)(det£ q)-^/^, 

where fi(d) is defined in 9.12.4.7. 

Proof. Consider a Euclidean structure on E with canonical measure £ (cf. 
9 . 12 ), and let / G Isom(E) be such that /(B(0, l)) = £(q), that is, such that 
/*9 ~ II ■ iPi norm of the Euclidean structure (cf. 13.1.3.9). In particular 
(13.1.3.10), since the matrix of || • p is /, the matrix of A satisfies / = ‘17 AU, 
where U is the matrix of / in an arbitrary orthonormal basis. We thus have 
det A (det/)^ = 1. By 2.7.4.3 we can write 

£{£(q)) = |det/|£(fl(0.l)) = |det/|^(d) = /3(d)(det£ 9 )-i/ 2 . □ 

The theorem of Loewner-Behrend will be demonstrated in 11.8.10.7 using 
the following proposition; 

11.8.9.5. Proposition. The function Q{E) B q £{£(q)) € R is strictly 
convex. 

Proof. Take q, q' G Q(E); by 13.5.5, there exists a basis, which we can assume 
to be £-unitary, diagonalizing q and q' simultaneously. Thus we can write 

i i 

For determinants calculated in this basis, we have detg = fli “•> detg' = 
f],- aji and, for A, A' >0 with A + A' = 1, 

det(A9 + A'g') = 


Applying 11.8.11.4, we can write 

(det(A 9 + A' 9 ')) = ]][(Aa,- + A'o')“*/^ < 


^{{uryrn'i 

< A(det9)-*/2 + A'(det9')-i/^ 


-1/2, 


Equality can only hold if it holds for eaclTpair (a;, oj), that is, if a,- = aj for 
every i. But this implies q = q' ■ 


11.8.10. Properties of convex functions. We have seen in 11 . 8 . 7.1 
that a convex function, even on a compact set, is not necessarily continuous 
or bounded; however, we will see that complications can only arise at the 
boundary and for upper bounds, as hinted in 11.8.7.1. 
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11.8.10.1. Proposition. A convex function f : A —* 'R is bounded above 
on every compact subset of the relative interior of A (cf. 11.2.8), and bounded 
below on every bounded subset of A. 

O O 

Proof. As usual, we reduce to the case A ^ 0. Let z G A be arbitrary, let H 
be a supporting hyperplane to Epigr(/) at (z,/(z)) (cf. 11.8.3 and 11.5.2), 
and take an affine form g : X R such that H = ^“^(0). We have f > g by 
construction; but an affine form is bounded over bounded sets. 

O 

Let K Q A he compact; for every x €: K, there exists a simplex Sx of 
X contained in A and such that z G lnt(f(5i)). By compactness, we can 
cover K with a finite number of such simplices; by 11.8.4, the value of f(x) 
at any point z G /f is bounded above by the values of / at the vertices of 
those simplices, and there are only finitely many such vertices. 




11.8.10.2. Lemma. Let / : [a, 6] —» R be convex, and take c Gjo,6|. 
i) The function [a, 6] \ c 9 t i—> | _ is increasing. The function f 

has derivatives to the right and to the left, denoted by fl{c) and fl.{c), 
and //(c) < //(c); in particular f is continuous at c. 
hi) If f is differentiable on ja, 6[, its derivative is increasing; if f is twice 
differentiable at c, we have f"[c) > 0. 



Figure 11.8,10.2 



11.8. Convex functions 


375 


Proof. Part (i) follows from the definition (see figure 11.8.10.2). For every 


t > c, the function t 

/(«) - m 


m - m, 


is increasing and bounded below by 


a — c 


so the limit 


exists; the same is true of //(c), and //(c) < //(c). As to /', if it exists, it 
must be increasing because /'(c) < — c, d G [o, fc[. 

The last assertion in (iii) is well-known. □ 

11.8.10.3. Notes. The last proof can be easily extended to show that 

{c G la,6[ j //(c) ^ //(c)} 


is countable. For finer properties of convex functions, and the infinite-dimen¬ 
sional case, see [R-V, chapter IV). 

Lemma 11.8.10.2 implies that a convex function / : A —» R, where A C X has 
arbitrary dimension, / has a directional derivative on any half-line starting 

, O 

at a point a G A. We apply this idea as follows; 

11.8.10.4. Proposition. Let / : A —» R be convex; then f is continuous at 
every point in the interior of A. 



Figure 11.8.10.4 


o o 

Proof. As always, we reduce to the case A ^ 0. Take a G A and e > 0 such 

O 

that C A (the ball is measured with respect to an arbitrary Euclidean 

structure). Using 11.8.10.1, define 

M = sup { /(z) I z G B[a, c) }. 

For arbitrary z G B(a,t], x ^ a, let u,v be the points where the line (a, z) 
intersects S(a, r). We apply 11.8.10.2 to the restriction of / to the affine line 

/(z) - /(g) /(u) - /(g) M - /(a) 

||z-a|| - ||u-a|| - ^ 
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and 

f{x] - f(a) f(v) - f(a] M - f(a] 

— ||z —a|| ~ ||w — a|| ~ * ’ 

whence 

\f(x)-f(a)\< ^~J^‘^^ \\x-a\\. □ 

11.8.10.5. Theorem. Let f : A R, be a convex function on an open 

convex set A. Then f is differentiable almost everywhere, and its derivative 
is continuous where defined. □ 

We will not have the time to demonstrate this important and natural theorem; 
the reader can consult [R-V, 116-117), or see 11.9.14. The matter of second 
and higher derivatives of convex functions is still a practically open research 
question (|R-V, 120]). Theorem 11.8.10.5 play an essential role in 12.10.11.1. 
The remainder of 11.8.10 is devoted to some extremum properties of convex 
functions. We start with the following 

11.8.10.6. Proposition. Let f : A be strictly convex. If f achieves its 
minimum, it does so at a unique point. 

Proof. If m = inf^ / and f(a) = m = f(b) we have 




m + m 


whence a = 6 by 11.8.5. □ 

We now apply this elementary proposition to the question we started studying 
in 11.8.9. 


11.8.10.7. Theorem (Loewner-Behrend). Let E be a finite-dimensional 
real vector space, endowed with a Lebesgue measure. If K is a compact subset 
of E with non-empty interior, there exists a unique least-volume solid ellipsoid 
containing K. 

Proof. By 11.8.10.5, 11.8.9.5 and the continuity of 9 i—► (9)), it is enough 

to show that we can find a minimum over an appropriately chosen compact 
convex subset of Q(E). Since K is bounded, there exists at least one solid 
ellipsoid f (go) containing K. The set we want is 

= { g 6 Q(E] I f (g) D if and det£ g > det£ go }, 


and we have to show that this is a compact convex subset of Q(E]. Convexity 
for the condition det£ g > det£ go follows from 11.8.9.5; as for the condition 
£ D K, v/e have 


f(g)= {xG Jr|g(a:) <1}, f (g') = { x G .If | g'(x) < 1 }, 

so if q,q' G A and A G [0, l] we indeed have 

(Ag + (1 - Ag'))(z) = Ag(x) + (1 - A)g'(z) < 1. 

The set A is closed in P2(E), being the intersection of inverse images of 
intervals under continuous maps, but we must watch out for the fact that 
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Q(E) is an open subset of P 2 (E). But the boundary of Q(E) is made up of 
degenerate quadratic forms, whose det^ is zero, so .4 is in fact closed. To see 
that it is bounded, observe first that £{q) D K implies £(q] O f (ff U 
where —K is the reflection of K through the origin, and that £ (ff U(— ff)) has 
non-empty interior because K does. Consider on E an arbitrary Euclidean 
structure and let £ > 0 be such that B(0, c) C £ U ( — K)); the eigenvalues 
Aj of g G Q(E], with respect to the Euclidean structure under consideration, 
must satisfy Ai < l/c for all z if we are to have £(q) D B(0, c); and this shows 
that A is bounded. □ 



We shall use 11.8.10.7 in 15.5.9, but we remark from now that it provides the 
promised third proof for 8.2.5 (a proof not involving integration theory, since 
volumes of ellipsoids are mere determinants!). 

11.8.10.8. Corollary. If G is a compact subgroup of GL(E), there exists 
q e Q(E) invariant under G. 

Proof. Let H be an arbitrary compact subset of E with non-empty interior, 
K = G(H) be the orbit of H under the action of G, and £(q) the least-volume 
ellipsoid containing K. The set f (g) is invariant under any g €: G, since, by 
construction, g(K) = K aud g{£(q)) D K. Finally, 

-c(?(f(9))) = -c(n9)) 

since detj = 1 for any g (otherwise G is not compact, as detgr" = (detgf)"). 
Since the least-volume ellipsoid is unique, we have sr(f(9)) = ^"(9)1 we 
have shown that the form q satisfies g*q = q for all gr G G. □ 

Now back to extremum values of convex functions. 

11.8.10.9. Proposition. If f : A JL is convex and continuous, f achieves 
its minimum at at least one extremal point of A, that is (cf. 11.6.8) 

/ = ®“PExtr(>») /• 

Proof. By 11.1.8.6 and 11.6.8, every a G j 4 is a finite barycenter of extremal 
points; thus, if M = suPExti(>i) /> we have /(a) < M for all a G j 4, by 11.8.4. 

□ 
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11.8.10.10. Note. Proposition 11.8.10.9 is of paramount importance in the 
practicail search for majcima, as in game theory and linear programming, for 
example ([KE, 86 ff.]): if the function whose maximum we seek is convex, 
it suffices to know its values at points of Extr(j4). Such points are finite 
in number if is a polyhedron, for instance (cf. 12.1.9). For.details and 
examples, see [KE] and [R-V, chapter V). 

11.8.11. Convexity criteria. Examples 

11.8.11.1. Proposition. Lei I C R te an interval and f : I R a 
twice differentiable function. In order for f to be convex it is necessary and 
sufficient that f"(x) > 0 for all x £ I. For f to be strictly convex it is 
necessary and sufficient that f"{x) > 0 for all x E I. 

Proof. Set, for a,b E I, 

g(x) = fix) - f(a) - (fib) - /(a))^; 

We have to show that ff < 0 over [o, fc]. But g" = f", so ff' is increasing; since 
ff' vanishes at some a E [a,f)] (at the point where g is minimal, for example), 
ff' is non-positive to the left of ot and non-negative to the right of a. Thus 
ff decreases starting at ff(a) = 0, then it increases again till gib) = 0. This 
shows that ff < 0. □ 



a a b 

g 

o 

o 

g’ 


g” 

+ 


Figure 11.8.11 



11.8.11.2. Corollary. Let f : A —* R be of class C^, where A is an open 
subset of X. In order that f be convex it is necessary and sufficient that its 
second derivative 

fix) :XxX--R 

be a positive semidefinite quadratic form for every x, that is, f"ix)iy,y) > 0 
for every y E X. For f to be convex it is necessary and sufficient that fix) 
be positive definite, that is f"ix)iy,y) > 0 for every y E X\0. 

Proof. Given a,b E A, it is enough to apply 11.8.11.1 to the map t -+ /(o -1- 
tab), whose second derivative is equad to /"(o -1- tah)iah, ab). □ 

11.8.11.3. Example. The function — logr is strictly convex on since 
(—logr)" = l/x^, whence 

— log(Aa -1- A'a') < —A log o — A' log o' 
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for A, A' > 0, A + A' = 1. Since the log is increasing, we get 

11.8.11.4 Ao + \'a' > o*o'^' 
for any A, A' > 0 with A + A' = 1. More generally, 

11.8.11.5 XiOi > fj a-i' 

« « 

for any set of A; >0 with A,- = 1. In particular, we obtain the well-known 
inequality between geometric and arithmetic means; 


11 . 8 . 11.6 


“1 ^ n y > 


where equality holds if and only if all the a,- are equal, since — logz is strictly 
convex. For n = 2, we recover the elementary inequality o -1- 6 > 2>/o6. 

11.8.11.7. Example. For every fixed real number p > 1, the function 

R+ 3 X € R 

is strictly convex, since f"(x) = p(p— We thus have 

11 . 8 . 11.8 

V, ' 

for any > 0 and A^ > 0 with = 1. This leads to the Holder inequality 
_ _ i/p /_ \ i/« 

11.8.11.9 


<^A.a? 


(E.?) 


(p > 1, 1/p-t- l/q = 1, x« > 0, Pi > 0), of which 8.1.3 is a peuticular case 
(p = 2). Holder’s inequality, however, is much less elementary than 8.1.3; to 
deduce it from 11.8.11.8, one must find numbers Aj, a,- and k such that 

Affl,* = x,*pi, A,*^ = ^ ^ Af = 1. 

i 

The right numbers are 

^=(E »0 

11.8.11.10. Lastly, one deduces from 11.8.11.9 that, for every p> 1, 

R'' 9(xi,...,x<i)i- 

defines a norm on R‘'. This requires proving the Minkowski inequality: 

11.8.11.11 (5 i., + kI')‘"< 
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this is done by using 11.8.11.9 twice, as follows: 


E 

i 

E 

t 

E 

i 


k,ik.+!/,r' < (ENr)""(Ei*<+wip)"’'‘'''’ 

Is/iIII.- + Vi * < ^E I'’j (E Ip] 

+ t/il'’ < JZdzil + |?/i|)(|a:i + t/il'’”*) 


<(EK+!'a'’) ((Ei'.r) +(Ei»r) )■ 


11.8.11.12. Notes. Inequalities 11.8.11.9 and 11.8.11.11 are absolutely 
essential in analysis, as they allow the definition of L,,-spaces. 

For more convexity inequalities, see, for example, |DE3, 40 ff.], [R-V, 
chapter VI] and the references therein, especially the classical book [H-L-Pj 
and the modern text [B-Bj. 


11.8.12. Convex functions versus convex sets. We have already 
encountered a connection between convex functions and convex sets in 11.8.3. 
Two more general relationships are discussed below, but not in detail: the 
reader can consult [EN, 54|. 

11.8.12.1. Throughout 11.8.12 X denotes a Euclidean vector space. A gauge 
on X is any convex function f : X —*1R. satisfying 


/(Ax) = A/(x) 

for any x € X and any A > 0. Norms, for example, are gauges. 

11.8.12.2. If / is a gauge, the ball C{f) = { x € X | /(x) < 1 } is a convex 
set. Conversely, if C is a convex compact set containing 0 in its interior, the 
function given by 

f<;(x) = inf { A I A > 0, X e AC } 

is a gauge, but will only be a norm if C = —C, that is, if C is symmetric. 
The boundary of C is /,7^(l)i we say that f(j is the distance function on C. 

11.8.12.3. If C is bounded in X, the function h(j(x) = sup { (j/ j x) | j/ S C ) 
is also a gauge, called the supporting function of X. 

11.8.12.4. The relationship between he and fc is a consequence of 11.1.5 
and 11.4.8; if C is a convex compact set with 0 € we have 


/o’* — ha, ha‘ — Ze¬ 


ll.8.12.5. A gauge is determined by its restriction to the unit sphere 5 = 
5(0,1). In general, it is not possible to explicitly recover C from h = h,; 
(given on 5). U h is differentiable, one can say that C is the envelope of the 
hyperplanes /ft = { x S X | (x j t) = 1 } for t ranging over 5, or again that C 
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Figure 11.8.12 


is the transform of h ^(l) under reciprocal polarities with respect to 5. In 
the case of the plane, h corresponds to the Euler equations of the curve FrC 
(see 12.12.14 and 12.12.15). 

11.8.12.6. Supporting functions are a powerful tool in the geometric study 
of convex sets: see, for example, [EN, 54 and the whole of chapter V] and 
[BUl, chapter II, § 6]. See also 11.9.14. 


11.9. Exercises 


11.9.1. Show that Minkowski addition satisfies the relations 

(A + B) + C = A+(B + C), 

(A U fl) + C = (i4 + C) U (B + C), 

(A n fl) + C = (A + C) n (B + C), 

B„.a(A+ B) = B„.4A) + B„,a(B), 

Ha.X + ii{,A) C Ha.x(A) + Ha.,i(A). 

When does equality hold in the last relation? 

11.9.2. TVue or false; For every bounded set A, diam(f (A)) = diam A. 

11.9.3. Let X be a d-diiueiisional affine space and A a subset of X with 
at most d connected components. Show that every point of f(A) is the 
barycenter of d points in A. 

* 11.9.4. Hilbert geometry. Let A be a convex compact subset of X 

0 

whose interior is non-empty. Given two distinct points x, y in A, set d(x, y) — 
|log[x,!/, u, w]|, where u,v are the two points where the line {x,y) meets the 

O 0 

frontier of A (figure 11.9.4). Show that d : A x A —» R, defined by the 

O 

equation above and d(x, x) = 0 for all x g A, is a metric. Show that this 
metric is excellent (9.9.4.4). Study the relation between the strict triangle 
inequality and the nature of the boundary points of A. For a notion of area 
in this geometry, see [B-K, 167]. See also 6.8.14. 
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11.9.5. Let X be an affine space of dimension greater than 1, and f : X —* X 
a bijection. Show that if, for any convex set A, the image /(A) is convex, 
then / is affine. 

11.9.6. Consider in the sets 

fl = { (2:,0) I a: > 1 or a: <-1 }, A = \ B. 

The set A is star-shaped (figure 11.3.6.0.3). For the homeomorphism F ; 
A —» R^ constructed in the proof of 11.3.6.1, draw the inverse images of 
circles around the origin. 

11.9.7. Let E be the vector space of n x n real matrices, and K the subset 
of E consisting of bistochastic matrices, that is, those satisfying 

Otj ^ 0 for aU *i = 1 for all j, Ojy = 1 for all i. 

i j 

Show that B is a compact polyhedron in E and that the vertices are the 
permutation matrices, that is, those whose every row or column have exactly 
one non-zero element, equal to 1. 

* 11.9.8. Prove that the extremal points of a convex set in the plane form a 
closed set. 

11.9.9. Consider in R^ the closed convex cone C defined by the inequalities 
x>Q,y>0, z>Q and < xy. Show that the line D with equations z = 0, 
z = 1 does not intersect C, but that there exists no plane containing D and 
not intersecting C. 

11.9.10. KiRCHBERGER’S theorem. Let A and B be finite subsets of a 
d-diniensional space X. If, for every (d-f- 2)-element subset of X, it is possible 
to find a hyperplane that strictly separates A n K and B D F, there exists a 
liyperplaue that strictly separates A and B. 

11.9.11. Let Ai (i = 1,..., r, r > d -f- l) be convex subsets of R'* such that 
any r — 1 of them have non-empty intersection. Show that the intersection 
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of the Ai is non-empty, as follows; take ij £ -^i (* = !> • • •> *■)) 

scalars A,-, not all zero, such that Aj = 0 and A,Zi = 0. Finish off 

by splitting the Aj into two groups, one non-negative and the other negative. 


11.9.12. Maximal and minimal widths. Let C be a compact convex set. 
The maximai (resp. minima]) width of C, denoted by D(C) (resp. d(C)), is 
the supremum (resp. infimum) of the width of C in the direction f, as f ranges 
over the set of directions of lines (cf. 11.5.6.3). Show that D{C) — diam(C), 
the diameter of C. 

The interior radius of C, denoted by r(C), is the supremum of the radii 
of all spheres contained in C. Show that, for d the dimension of the ambient 
space, we have 

r(C) > Jl. for odd d, 

^ 2y/d 


See [EN, 112-114] if necessary. Are these inequalities sharp? 


11.9.13. Let A be an open convex set and / : A —♦ R a function satisfying 

J /H+ib) 

for every x,y € A. Show that A is convex. See |VE, 130] if necessary. 


11.9.14. Supporting functions. Find an explicit formula for the sup¬ 
porting function of a point, of a ball S(o, r), and of a cube centered at the 
origin. Show that if / (resp. g) is the supporting function of A (resp. B), the 
supporting function of the Minkowski sum AA -t- fiB is simply A/ -f- fig. 


11.9.15. Differentiability of convex functions. Let A be an open 
convex set and / : A —♦ R a convex function. Show that if the partial 
derivatives dfjdxi (with respect to some frame {cj} for the ambient space) 
exist at a point a £ A, the function / is differentiable at a. Deduce, using 
11.8.10.2, that / is differentiable almost everywhere (use Fubini’s theorem). 


11.9.16. Convex sets and the geometry of numbers. Let A = ZxZ 
be the unit lattice in R^, and C a compact convex subset of R satisfying 
—C = C. Show that if the area of C (cf. 9.12 or 12.12) satisfies 11(0) > 4, 
then C intersects A \ (0,0), that is, A contains points with integer coor¬ 
dinates not both zero (Minkowski). Generalize for dimension d. Deduce 
that if f(x, y) = ax^ -f- 2bxy -f- cy^ is a positive definite quadratic form with 
discriminant D ~ ac — b^, there exist integers x,y, not both zero, such 
that f(x,y) < AfnyfD. For elaborations on this result, the first in the so- 
called “geometry of numbers”, see the excellent work [CSj. See also [LE] and 
[DE3, 202-203]. 


11.9.17. Show that for every compact convex subset A of a Euclidean affine 
space X we have Isx(X) = IsExf.r(A)(-^)' 
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11.9.18. Is there a connection between Jung’s theorem (11.5.8) and Helly’s 
theorem (11.7.1)? 

11.9.19. KaKUTANI’S lemma. Let C\ and <72 be disjoint convex sets, and 
z ^ Cl U C2 a point. Denote by Fj the convex hull of {z} U C,- (i = 1,2). 
Show that at least one of the sets Fi n C2 and F2 n Ci is empty. 

* 11.9.20. Given a subset A of an affine space X, consider the set N(A) of 
points o such that A is star-shaped with respect to o (cf. 11.1.2.4). Show that 
N(A) is convex. Find N(A) for a. number of shapes of A. 

* 11.9.21. The theorem of Lucas. Let P be a polynomial with complex 
coefficients, and let P* be its derivative. Show that in the affine space C, 
all the roots of P* belong to the convex hull of the roots of P. When P has 
degree three and distinct roots a,b,c, show that there is an ellipse inscribed 
in the triangle {a, 6, c) whose foci are the roots u, v of P*. 

* 11.9.22. Find aU partitions of the plane into two convex sets. 

11.9.23. The theorem of Kritikos. Let C be the frontier of a compact 
convex subset K of a. Euclidean space E, and assume K has non-empty 
interior. Given m € A, denote by R(m) the radius of the smallest sphere 
with center m containing K, and by r(m) the radius of the largest sphere 
with center m contained in K. 

Show that there exists a unique point m € k such that the difference 
jR(m) — r(m) is minimal. Show that for this point the associated containing 
and contained spheres have at least two points in common which C. 

Deduce that for every compact convex differentiable hypersurface C of 
E (that is, the differentiable frontier of a compact convex set) there exists at 
least one point from which one can draw four normals to C. 

11.9.24. John’s Ellipsoid. Without using 11.8.10.7 show that every com¬ 
pact convex set in R'' contains an ellipsoid of maximal volume. Morevoer that 
ellipsoid is unique. Show that the considered convex set is always contained in 
the ellipsoid deduced from the preceeding one by a homothety of ratio d and 
of center equal to the center of this ellipsoid. Show that, when one restricts 
the bodies under consideration to ones having a fixed center of symmetry 
then this ratio can be reduced to Vd. Show that these two ratios are optimal. 

11.9.25. Rado’s Theorem. Let T be a set of with at least d-|-2 points. 
Show that there exists a decomposition T = Ti U T2 in two disjoint subsets 
such that £{Ti)n£{T 2 ) ^ 0. Draw examples. Show that, from any one of the 
three theorems of Caratheodory, Kelly and Rado, one can always deduce any 
one of the two others. 
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— morphism 2.3, 2.S.1, 3.2, 3.5 

— polynomial 3.3.5 

— quadratic form S.S.6, 15.2 

— quadric 15.1.1, 15.3, 20.1.2 

— space 2 .1.1 
-, complex 2.8.7 

-, complexification of 7.6.2 

-, dimension of 2.1.8 

-, orientation of 2.7.2 

-s, differential calculus in 2.7.7 

-s, isomorphism between 2.2.6 

-—s, product of 2.2.2 

— subspace 2.4-1, 5.3 

-s, intersection of 2.4.2.4 

Alexandroff 12.11.5.3 

— compactification 5.2.6, 18.1.3.2, 

20.2.4 

algebra 0.2 

algebraic curves 16.4.9 

— dual 0.2 

— equations, solution by radicals 

16.4.11.1 

— geometry 4.0.5, 4.2.6, 12.7.5.1, 

17.1.6 

— group 12.6.9 

— hypersurface 14.1.4.2 

— number field 13.1.4.5 
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— topology 4.0.5, 4.3.7, 4.3.9, 

12.7.5.3, 11.86, 18.8.7, 19.1.3.3 

-of orthogonal groups 8.10 

-of projective spaces 4.3, 4.8.2 

-of the sphere 18.2.1 

— varieties 14.1.6.2 
algebraically closed base field 6.4.2, 

6.5.1, 6.7.2, 6.7.4, 13.4.3, 13.5.3, 

14.1.5.3, 14.1.6.2, 16.2.10 
Alhambra 1.7.1 

alternating bilinear form 2.7.2.1, 
14.8.12.2 

— group 0.2, 1.8.4, 6.3.2, 12.5.5.6 
altitude of a spherical triangle 

18.11.5 

— of a triangle 9.4.1.3, 10.1.4, 

10.13.1 

— of a tetrahedron 10.6.6 
analysis 6.6, 11.8.11.12, 13.5.6, 

19.7.3 

analytic geometry 2.4.8, 10.7.6, 
14.1.4, 14.5.3, 17.1.6 
-spheres 10.7.7, 20.4-2 

— between tangent vectors to a 

sphere 18.1.2.4 

— , dihedral 12.1.12 

— , non-oriented 8.6 

— of a rotation 1.7.5, 9.S.4, 9.S.5 

— of an orientation-preserving 

similarity 9.6.3 
—s and similarities 8.8.5 
—s in Artinian plane 13.8.5 
—s in Euclidean affine space 9.2.1 
—s in hyperbolic space 19.2.7 
—s of an elliptic triangle 19.1.S.1 
—s of a spherical triangle 18.6.6 
—s of a triangle 10.1.2 

-, sum of 10.2.4 

animals 12.5.5.7, 11.86 
anisotropic IS.2.1, 13.9.4 
antir Euclidean 19.2.7 
antihoinographies 18.10.2.4 
Appolonius 17.1.5, 17.3 

— ’ formula. 9.7.6.1 

— ’ hyperbola 17.5.5.6 


— 'problem 10.11.1 

— ' theorem 15.6.4 
approximation lemmas 12.9.2 
Apulia, Italy 11.22 

arc cosine 8.3.11, 9.9.7.1, 18.6.6 

— of circle 8.7.5.4 
archimedean field 8.5.2 
Archimedes 15.7.6 
architecture, 11.22; see also shell 

architecture 14.4.6 
area 9.12.4.1, 9.12.7 

— and Steiner symmetrization 

12.10.10 

— of a compact convex set 12.10.2 

— of a triangle 9.7.3.8, 9.12.4.4, 

10.1.5 

— of ellipse 17.7.5 

— of pedal triangles 10.4.5 

— of poly topes 12.S 

— of the sphere 12.10.4.1 
—s , calculation of 9.12.6.3 
—s on the sphere 18.S.8.1 
argument 8.7.9 

arithmetic 6.6, 10.13.29, 12.4.6, 
13.7.2, 19.7.3 
arrow 3.1.2 
art l.ix, 1.7.1 
Artin 13.7.2 

Artinian plane IS.1.4-4, 13.8, 19.2.5 

— space IS. 1-4-4 

-, quadrics in 14.4 

associativity of barycenters of 

punctual masses 3.4.8 

— of the base field 2.5.5, 5.4.4 
asteroids 17.2.1.7 

astroid 9.14.S4.S, 17.7.4 
astronomy 11.146, 18.6.1, 18.6.11, 
18.6.13.8 

asymptote 15.5.11, 17.8 
asymptotic cone 15.5.11 
atlas 4-^-i 
automobile 18.6.11 
automorphic function 6.6.7 
axiomatic point of view 5.0.1 
axiomatization 2.6.7, 2.5.5 
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— of affine geometry 2.6.7 

— of Euclidean geometry 9.2.3 

— of projective geometry 4.5.12, 

4.6.7 

axis of an ellipse 16.3.10.2 

— of a rotation 8.4-7.1, 9.S.5 

— of a screw motion 9.S.5 

— of inertia 13.5.6 

B 

bailing-out maxim 11.1.2.2 
balls, volume of 9.12.4.6 
barycenter S. 4 .I, S.4-5, 9.7.6, 
9.12.6.1, 11.1.8.3 

— , calculation of 3.4.6.6 

— coordinates S.6.2, 10.6.8, 

10.13.26 

— subdivision S.6.5, 3.7.8 
base field, see associativity, 

commutativity, etc. 

— of a pencil 14-2.7.S 
Basel cathedral 9.6.9, 9.14.32 
basis of a complexification 7.1.3 
Beltrami 19.7.2 

Bernoulli, Jakob 9.6.9 
Bernoulli’s tomb 9.6.9, 9.14.32 
Betti numbers 4.3.9.4 
Bezout’s theorem 16.4 
bicircular quartics 9.6.8 
Bieberbach’s isodiametric inequality 
9.13.8 

Bieberbach’s theorem 1.7.7.3 
bijections 0.2 
bilinear form 8.1.1,11.86 
billiards 9 . 4 .I, 9.14.9, 17.9.2 
Binet’s formula 17.2.1.7 
binomial coefficients 0.2 
bisector 8.7.S.2, 8.7.7.4, 8.12.10, 
9.14.3, 18.11.5, 20 . 4 . 4 , 20.8.2 
bistochastic matrices 11.6.6.3, 

11.9.7 

bitangent conics 16.4.10.3 
Blaschke Kugelungsatz 9.13.6, 
9.14.24 


Blaschke rolling theorems 12.12.14 
Blaschke selection theorem 9.11.4 
Blaschke-Lebesgue 12.10.5 
Bolyai 19.7.2 

Bonnesen’s theorem 12.11.9.4, 
12.12.15 

Borsuk-Ulam, theorem of 18.2.7 
bound vector 2.1.S 
box 9 . 12 . 4.2 
Boy’s surface 4-S.9.1 
Brahmagupta’s formula 10.13.7 
braid lemma 9.5.4.9 
Brianchon’s theorem 16.2.13, 19.3.5 
British Museum 12.5.5.7 
Bruhat-Tits lemma 9.8.6.5 
Brunn-Minkowski theorem 
11.8.8.1, 12.11.3 

C 

calculation in affine spaces 2.3.8, 
9.2.6 

— in projective spaces 4.2.1 

— of areas 9.12.6.3,12.10.11 

— of duals 14.5.3 

— of barycenters 3.4.6.6 

— of cross-ratios 6.2 

— of cross product 8.11.10 

— of projective morph isms 4.5.13 

— of quadrics 14.1.4 

— with affine polynomials 3.3.3 

— with vectors 3.1 
canonical density 8.11.3 

— measure of affine spaces 2.7.4.2 
-of Euclidean affine spaces 

9.12 

-of elliptic spaces 19.1.1.7 

-of hyperbolic space 19.5.1 

- -of spheres 12.3.2, 18.3.7 

— norm 9.2.6.4 

— projection onto a projective space 

4 . 1.1 

— topology of an affine space 

2.7.1.1 

-of a projective space .^.5.1 
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— volume 9.12 

-form 8.11.S, I 8 .S .4 

car 18.6.11 

Caratheodory’s theorem 11.1.8.6, 
11.5.8, 11.7.4.2 

cardboard polyhedra 12.1.3.2, 12.8 

cardinality of a set 0.1 

cardioid 9.6.8, 9 .I 4 .S 4 .S, 9.14.34.5 

Caron, Joseph I.xi 

Cartan-Dieudonne theorem 13.7.12 

Carthage 12.11.8 

cartography 4.7.5, 18.1, 18.1.5.4 

— of FVance 18.1.8.5 

— of Switzerland 18.1.8.3 
cash prize 10.6 

Castel del Monte, Italy 11.22 
Castillon’s problem 10.11.4, 

16.3.10.3 

Cauchy’s formula 12.3.3, 12.8.1, 
12.10.2, 12.11.9.6 

Cauchy’s lemma 12.6.6.1, 18.6.11, 
18.7.16 

Cauchy’s theorem 12.8.1 
Cauchy-Schwarz inequality 

12.11.5.4 

caustic 9.14.34.3 

Cayley 5.2.2, 1.122, 8.8.7.5, 16.6.1, 

16.6.12.4 

Cayley numbers, see octonions 
Cayley-Menger determinant 
9.7.S.1, 9.7.4, 9.14.23 
Cayley’s formulas 17.6.7 
celestial charts 18.1.8.6 
cell decompositions 4.3.9.4 
center of a compact set 2.7.5.5 

— of an isometry 9.S.S 

— of a group 1.6.7.2 

— of a quadric 15.5.S 

— of a regular polygon I 2 . 4 .S 

— of a regular polytope 12.5.2.1 

— of a rotation 1.7.5 

— of a rotation 9.S.4 

— of a similarity 9.5.2 

— of a sphere 10.7.1 

— of curvature, see curvature 


— of mass 2.7.5.3, S. 4 .S, 3.4.6.1, 

12.5.2.1 

center of the ortogonal group 8.2.16 
central field S.1.2.2 

— quadric 15.5.S 
centrifugal “force” 17.5.5.5 
centroid 2.7.5.2, 3.4.2, 9.12.6, 

11.5.9.3, 11.7.6 

Ceva, theorem of 2.8.1, 3.7.12, 

10.13.1.2 

chain of theorems 10.9.8, 10.13.19 
change of variable theorem 0.6, 

2.7.4.2 

characteristic classes 14.3.7 
characteristic function 0.6 
chart 4 . 2 . 1 , 18.1.5.1, 18.2.1, 18.11.2 
chart, what you want in a 18.1.7 
charts in P(E) 4.5.13 
Chasles’s relation 2.1.5, 8 .7.2. 4 , 
10.9.7.2, 16.6.1 
Chasles circles 17.9.11 
Cheeger-Simons, conjecture by 
18.3.8.6 

Chinese lantern 9.12.7 
chords 10.9.2.1 

circle 8.3.3, 8.12.4, 9.4.4, 10.7.1, 
11.6.1, circle 18.1.S.1 

— as a conic 17.4.2 

—s in hyperbolic space 19,3.3.2, 

19.6.8.2 

—s in the plane 10.9 

— , classical problems on 10.11 
Circle Limit 11.342 

circular group 20.6.1 
circular sections 15.7.14 
circumscribed 12.5.2.1 
class formula 1.6.6, 1.8.3.1 
classical geometry 2.7, 1.200; see 
also Greeks 
classical groups 16.3.9 
classification 1.6.5 

— of affine quadrics 15.3 

— of crystallographic groups 1.9.9 

— of Euclidean affine quadrics 

15.6.1 
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— of quadratic forms IS.1.4-1, 13.4 

— of quadrics 14.1.5 
Clifford algebra 8.9.13, 8.10.3, 

13.6.8 

Clifford parallelism 4.3.7, 18.8.1.2, 

18.8.4, 18.9.3, 18.11.17, 19.1.4, 
-, generalizations of 18.9.4.4 

— parallelogram 18.8-4 
closed ball O.S 

— convex hull 11.2.S 

— hyperplane 2.7.S.2 
closedness of stabilizers 9.8.3 
cobordism 4.0.5 

cocoon ll.S.6.0 

cocube 12.1.2.5, 12.5.4.3; see also 
standard cocube 
cogwheel design 9.14.34.6 
cohomology rings 4.3.9.4 
combinations 1.5.2 
comets 17-2.1.7 
common perpendicular 9.2.6.5 
commutative algebra 16.4.9 
commutativity of base field 2.3.3.9, 
2.5.5, 4.5.6, 4.9.8 
compact convex set 9.12.7, 12.9 

— manifolds 4.0.5 

— subgroups of GA(Jf) 2.7.5 
compactness of orthogonal group 

8.2.3.3 

— of projective spaces 4.3.3 

— of sphere 18.2.1 

— of stabilizers 9.8.6 
compass 10.8.3, 10.9, 10.11.2 
complementary affine subspaces 

2 . 4.94 

complete quadrilateral S.7.18, 

6.4.4, 17.9.19 
completely faithful 5.1.3 

— singular IS.2.1 
completeness of hyperbolic space 

19.2.6 

complex afiiiie spaces 2.8.7 

— analysis 8.2.14 

— conjugation 2.6.1 

— exponential 8.3.7 


— numbers 0.2 

-and plane similarities 8.8.4, 

9.6.4 

— projective quadric C(4) 20.5.7 

— projective spaces 9.1.7, 19.1.5 

— proper quadrics 14.3.5 

— structure for Euclidean plane 

8.3.12 

complexification 5.0.5, 7, 7-4-1, 
18.6.13.11 

— , canonical 7.0.5 

— , natural 7.7.5 

— of an affine morphism 7.6.S 

— of an affine space 7.6.2 

— of a projective morphism 7.5-4 

— of a projective space 7.5.1 

— of a real vector space 7.1.1 

— of conics 17.4.1 

— of morphisms 7.2 

— of polynomials 7.3 

—s and similarities 8.8.6 
—s and subspaces 7.4 
computation, see calculation 
concave function 11 . 8.6 
concavity 9.14.18 
cone 11.1.6, 11.6.2, 14.5.S 
conformal chart 18.1.7.2 

— group 20.6.1 

— map 9.5.4.2, 18.10.S.1, 19.8.6 

— model 19.6.1 
congruent 10 . 2.6 

conic 1.2.7, 1.4.2, 13.3.2, I 4 .I.I, 
15.1.1 

— at infinity 15.1.S.2 
conjugate diameters 15.5.10 

— elements in S„ 1.6.4 

— of a quaternion 8.9.1 

— set 15.6.S 

— subgroup 1.5.3 
conjugation 2.6.1 

— , complex 7.1.1, 7.5.1, 7.6.2 

— by inversions 10.8.4 

— with respect to a quadric 14-5.1 
connectedness of 0'*'(E) 8.4.3 
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— of the complement of subspaces 

2.8.7 

— of the projective group 4.9.5 

— of the sphere 18.2.1 
conoid 14 . 1 . 7.1 

constant vector field S.1.2.1 

— width 12.10.5 
contact theory 16.4.11.3 
continuous screw motion 9.14.7 
convex compact sets 9.4.3 

— cone 11.1.6 

— curve ll.S.10.1 

— function 11.8,11.^.1,11.8.10, 

11.8.12, 11.9.15 

— hull 11.1.8.1 

— polar body, see polar body 

— polyhedron 12.1.1 

— sets 8.2.5.3, 11, 11.1.1, 11.4, 

12.9.1 

-, frontier of 11.6 

-, topology of 11.3 

— spherical polygon 18.7.1 

— surface 11.S.10.2 
convexity criteria 11.1.7, 11.8.11 

— inequalities 11.8.11.12 

— of quadrics 15.4.6 
coordinates, affine 2.2.9 
correlation 14.5.5, 14-8.12.1 
cosine 8.S.6 

cosine-haversine formula 18.6.11 
Cosiuus 18.11.24 
cowlick 18.2.5.4 
Cramer’s rule 6.2.1 
crank 10.8.3 

cross product 8.11, 8.11.8, 8.12.9, 
15.6.6 

cross-ratio 6.1.2, 6.8.IS, 9.6.5, 
10.13.17, 14 . 4 . 4 , 16.2.5, 

16.2.7.S 

—s , action of permutations on 6.3 
—s and duality 6.5 
—s and field structure 6.4.8 
—s and projective coordinates 
6.5.10 

—s , calculation of 6.2 


crystal 12.5.5.7 

crystallographic groups 1.7, 1.7.S, 
1.9.9 

cube 1.8.3.4, 12.1.2.5, 12.5.4.2; see 
also standard cube 
cubic 16.6.1 

cultural constructs 12.5.5.7 
cup product 11.86 
curvature 9.10.7, 12.10.9.1, 17.7.4, 
17.9.9, 18.3.8.6 

— , lines of 9.5.4.21 
curve 2.8.12, 9.9.1 

—s of constant width 12.10.5 
cycles of a permutation 1.6.4 
cyclic group 0.2 

— permutation 10.4.5.1 

— points 9.5.5.1, 10.13.1.3, 17.4.1, 

17.4.3, 17.5 
eyelid 20.7.2 

cylindrical wedge 12.12.20.5 
D 

Dandelin 17.3 
Darboux, theorem of 20.8.7 
da Vinci, Leonardo 1.26, 11.3 
De Divina Proportione 1.26, 11.3 
deficit 12.11.9.S 

degenerate quadratic form IS.2.1 

— conic 16.4.10 

— quadric I 4 .I.I, 14-2.7.5, 14.5.6 

— simplex 14.5.4-1 
degree 18.2.5.2 
derivative 2.7.7 
Desargues 4.0.2, 5.0.1 
Desargues’s theorem 2.5.4, 2.6.7, 

4.8.3, 5.4.3, 5.4.7, 6.7.4, 

14.2.8.3, 16.5.4 
Descartes 17.1.6 

— oval 9.14-22 

— ’s law 9.14.22 

— ’s priuciple 17.1 
determinant of a form 11-8-9-2 
diagonalization, see 

orthogonalization 13.9.6 
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diameter of a set O.S 

— of a quadric 15.5.1 

— and Steiner symmetrization 

9.13.4 

dictionary 14 
Dido 12.11.7 
Dieudonne 13 
difference set 0.1 

differentiability of convex functions 
11.9.16 

differentiable curve 2.8.12 

— hypersurface 11.9.23 

— manifold 1.7.7.4, 4.2.6, 4.3.2, 

9.10.7, 9.12.2, 9.12.7, 9.9.6, 
10.7.4, 11.3.10.1, 11.9.23, 

12.10.9.1, 14.8.5, 18.10.3.1, 

18.3.1, 19.1.1.4 

— —s of constant negative 

curvature 12.6.8 

— maps, singularities of 12.6.8 

— calculus 2.7.7 

— equation 6.6.7, 6.8.12 

— forms 18.3.4 

— geometry 4.0.5, 6.6, 9.10, 9.5.4.1, 

9.5.4.9, 10.8.5, 13, 16.4.12, 
18.10.3, 19.6.12, 19.7.3 

— operators 13.5.6 

difficulty in measuring angles 8.3.13 
dihedral angle 12.1.12 

— group 0.2, I. 8 .S. 4 , 1.8.7, 12.4.5 
dilatation 2.S.S.12, 6.6.2 

—s , characterization of 2.5.6, 5.2.1 
dimension of a convex set 11.2.6 

— of an affine space 2.1.8 

— of an affine subspace 2 . 4 .I 

— of a projective space 4-i-l 

— oiA(X-,X') 2.3.4, 2.3.9 
Dinghas-Hadwiger-Bonnesen 

inequality 12.11.9.3 
diopters 9.14.22 
direct sum 0.2 
direction of a line 19.2.10 

— of an affine subspace 2 . 4 .I 
directrix 17.2.1.4, 11-9.1 
discrete action 1.7.5.1 


discriminant IS.I. 4.6 
distance 0.3 

— function 11.8.12.2 

—s in Euclidean affine spaces 9.2.2, 
9.2.5, 9.7, 9.10 
distinguished point 3.1 
division algebras 12.6.8 

— by zero, convention about 6.2.2 
D. K. Kazarinoff 10.4.6 
dodecahedron 1.8.3.4, 1.8.4, 12.5.5 
domestic animals 10 
dominated convergence theorem 0.6 
doors 8.1.8.5 

double pedal 9.6.8 
double point 4.5.18, 14.1.5.S 
Dowker’s theorems 12.12.5 
dual, algebraic 0.2 

— polytope 12.6.2.4 

duality 6.5, 8.1.8, 11.1.5, 12.1.2.6, 
16.5.6 

— with respect to a quadric 14.5 
Dupin eyelid 18.11.19, 20.1.S 
Dupin’s theorem 9.5.4.21 
Dvoretsky 11.7.5 

E 

Eadem mutata resurgo 9.6.9 
Earth 4.7.5, 18.1.3.2, 18.2.5.4 
eccentric spheres 10.1.5 
eccentricity 17.2.1.4, 17.9.23 
Escher, M. C. 4.7.6, 1.7.4, 1.7.6, 
1.7.7.6, 19.6.12 
edge 10.6.1 

— of a polyhedron 12.1.8 

— of a triangle 2 . 4 .I 
Egyptian tablet 10.6 
Eiffel Tower 11.146 
eigenvectors of quadratic forms 

13.5.1 

elation 16. 4 .IS 
electricity 9.14.34.6 
elegance 7.0.3 
element of volume 18.3.7.1 
Elements 2.6.7 
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elementary geometry 4 

— volume 12.2.6 
eleven-point conic 16.5.5.1 
£lie Cartan 8.10.3 

ellipse 1.4.2, 9.4.3, 12.12.12, 
15.S.S.2, 17.7 

ellipsoid 12.12.2, 15.S.S.S, 15.6.S, 
15.7.9, 

— of revolution 18.1.5.3 

— of inertia 13.5.6 

— , volume of 11.8.9.1 
elliptic equilateral sets 19.8.24 

— function 16.6.1 

— geometry 19.1 

— integral 17.7.5 

— paraboloid 15.S.S.S 

— space 9.7.4, 19.1.1.1 

— transformation 6.8.8 
endomorphism 2.4.9.7 
endpoint of a half-line 19.1.1.5 

— of a curve 9.9.1 

— of a segment S.4.S 
envelope 9.6.7, 11.8.12.5, 17.8.3.2 
envelope equation 14.6.1, 14.8.13 
epicycloid 9.14.84-1, 12.12.20.1 
epigraph 11.8.8 

equality of triangles 10.2.6, 
18.6.13.10, 18.11.8, 19.8.4 
equation of a quadric I 4 .I.I, 15.1.1 
—s of the center 15.2.4 
equator 18.1.2.8 
equiaffine curvature 2.8.12 

— geometry 2.7.6 

— length 2.8.12 
equibarycenter 8 . 4 .S 
equidistant hyperplane 9.7.5.1, 

19 . 4.2 

— loci 18.4.6, 19.1.2.3 
equilateral triangle 10.1.8 18.6.18.9 

— hyperbola 17.1.8, 17.5.1, 17.8.3, 

17.9.16 

— set 19 . 8.24 

equivalence of quadratic forms 
13.1.4 

equivalent chart 18.1.7.4 


Erdos-Mordell, theorem of 10.4.6 
ergodic theory 9.4.4, 19.7.3 
Erlangen program 1 
essential singularity 9.5.4.8 
Etruscans 12.5.5.7 
Euclidean affine plane, see 
Euclidean plane 

-quadrics 15.6 

-space see Euclidean space 

— geometry 1.8.6, 8.1.8.5 

— plane 3.4.2 

-, complex structure for 8.3.12 

-, tilings of 1.7.4, 12.6.10.4 

— space 2.7.5.8, 3.7.8, 7.0.1,9.1.1 

— terminology 8.4.7.3 

— vector space 1.2.5, 8.1.1 
Euclid’s axioms 2.4.4 

— definitions 2.4.9.1 

— fifth postulate 2.4.9.5, 11.319, 

19.1.1.6 

— Elements 2.6.7 
Euler angles 8.9.5 

— characteristic 12.7.5.4, 12.10.9.2 

— equation 11.8.12.5, 12.12.15 

— identity 8.7.12 

— formula 12.7.3 
everyday experience 2.7 
evolute 9.10.8, 9.14.21, 9.14.34.4, 

17.7.4 

exact homotopy sequence 8.10.3 
excellent space 9.9. 4 .4 
exponential map 8.3.7 
exposed point 11. 6 .4 
extended real line 5.2.3 
exterior algebra 4.3.3.1, 8.11.1; see 
also alternating bilinear form 
exterior bisector 9.4.1, 10.1.4 
externally tangent 10.7.5 
extremal point 11. 6 . 4 , 11.9.8, 12.1.9 
eyesight 4.0.4, 4.7.3 

F 

face of a polyhedron 12.1.5 

— of a triangle 2-4-7 
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faithful action l.S.l 

Fedala, Spain 11.132 

Fedorov 1.7.7.1 

Fermat 9.10.6 

Fermat numbers 12.4-6 

Fermat’s problem 10.4.3 

Feuerbach theorem 10.11.3 

fiber 18.1.S.6 

fiber bundle 8.10.3 

field automorphism 2.6.1, 14.8.12.2 

field of prime characteristic 3.7.1 

— structure and cross-ratios 6.4.8 

— with four elements 6.3.2 

— with seven elements 16.8.17 

— with three elements 16.8.16 
—s , convention about 0.2 
finite fields 4.1.3.7, 4.6.16, 4.9.11, 

12.6.8, 13.1.4.5 

— geometry 15.4.8 

— subgroups of GA(Ar) 3.7.3 

-of Is(S'*) 18.5.10 

finite-dimensional affine spaces 2.7 
finiteness of stabilizers 9.8.5 

first fundamental theorem of 
projective geometry 4.5.10 

— variation formula 9.4.1.2, 9.10.1, 

9.10.7 

fixed points of homographies 6.6.1 

flag 12.5.1 

flat angle 8.7.S.1 

— torus 18.11.17 
flexible 12.8.2 
focal axis 17.2.1.4 

— circle 17.9.1 

— point 6.6.4 

focus 17.2.1.4, 17.4.3, 18.11.15 
foci, determination of 17.9.24 
Ford circle 10.13.29 
forms and densities 9.12.2 
formula of the median 9.7.6.5, 
9.8.6.4 

— of the three levels 12.12.20.7 
formulas for hyperbolic space 19.6.9 

— for spherical triangles 18.6.13 

— for triangles 10.3 


foundry 11.152 

Fourier series 12.11.9.4, 12.12.12 
four-sphere 4.8.2, 18.1.3.4 
fractal 9.9.3.3 
frame, see affine frame 
free fall 17.5.5.5 

— vector 2.1.S 

Firegier point 16.S.6, 16.8.7 

— ’s theorem 16.3.6 
frontier of a convex set 11.6 
frustum 10.6.8.1 

Fubini’s theorem 0.6, 9.12.3, 11.3.2, 
12.11.4 

Fubini-Study metric 19.8.22 
Fuchsian group 19.6.12 
full pencil 16.5.1 
functional analysis 11.4.2 
functorial correspondence 3.2.2 
functoriality 5.1.3 

— of complexification 7.2 
fundamental domains 1.9.12 

— formula of hyperbolic 

trigonometry 19.3 

— formula of spherical trigonometry 

18.6.8 

— group 8.10.3, 12.7.5.3, 18.2.1 

— quadratic form 20.2.1 

— quadric 20.2.4 

— theorem of affine geometry 6.4.8, 

2.6.3 

— theorem of projective geometry, 

first 4.5.10 

— theorem of projective geometry, 

second 5.4.8 

— tiling 1 . 7.4 

G 

GA(X), compact subgroups of 2.7.5 

— , finite subgroups of 3.7.3 
G-action 1.1.1 

Galois theory 12.6.10.3 
game theory 11.8.10.10 
gauge 11.8.12.1 
Gauss 12.4.6, 18.1.5.3 
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— approximation 6.6.5 
Gauss-Bonnet formula 18.3.8.6 

— theorem 12.7.5.2 
Gauss projection 18.1.8.S 

— ’s method 13.4.8 
general topology 0.4 
generalized circle 17.4.2.1 

— sphere 20.1, 20.5.1 
generating lines IS.7.10, 14-4-1 
generators for 0(E) and 0+ 8.4 
genus 12.7.5.4 

geodesic 9.9.6 

geodetic reference spheroid 18.1.5.3 
geoid 18.1.5.S, 18.1.6.3 
geometric algebra 8.5, 13.3.5 

— arcs 9.9.2 

— constructions 6.2.7, 6.4.6, 9.3.6, 

9.6.2, 12.4.6 

— realization of hyperplane 

reflections 14.7.4 
geometrical optics 6.6.5, 6.8.4 
geometry 1.4, 13.7.2 

— of numbers 11.9.16 
Gerardin 14.8.16 
Gergonne 10.11.1 
Germany 11.133, 11.152 
ghost edges 12.8.6.4 
Girard’s formula 18.3.8.4 
G\j(E), compact subgroups of 

11.8.10.8 

good conic 16.5.4 

— parametrization 16.2.9, 16.7.2 
GP(£7), involution in 4.5.19, 6.4.7 

— , topology of 4.5.20 

Gram determinant 8.11.5, 9.2.6.2, 
9.7.3.3, 15.6.3, 18.11.13, 
18.3.7.1 

Gram-Schmidt orthonormalization 
8.1.4 

grassmannian 1.2.5, 1.5.9, 1.9.10, 
8.2.7, 8.6.1, 14.3.7, 14.8.4 

— coordinates 2.4.8.8 
Graves 17.6.4 

great circle 18.1.2.1 


Greeks 2.7, 1.152, 1.200, 12.5.5.7, 
11.170 

Greenwich meridian 18.1.6.5, 

18.1.8.3 

Gromov 12.11.4 
ground field, see base field 
group action 1.1.1 

-, discrete 1.7.5.1 

-, faithful 1.3 

-, p-transitive 1.4.5 

-s associated with a great circle 

18.8.3 

-, section of a 1.6.6 

-, simply transitive 1.4.3 

- j transitive 1.4 

— of a proper quadric 14.7 

— of a quadratic form 13.6 

— of permutations 0.2 

— presentations 1.8.7 

— structure on spheres 8.9.1 
Guldin, theorems of 12.12.20.9 

H 

Haags Genieentemuseum, The 
Hague 1.7.4, 1.7.6, 4.7.6, 
19.6.12 

Haar measure 8.2.5.1 
Hadamard 7.0.1 
Hahn-Banach theorem 11.4.1 
hairy ball 12.11.5, 18.2.5.4 
half-line 8.6.1 

—s in elliptic space 19.1.1.5 
—s in hyperbolic space 19.2.10 
half-perimeter 10.S 
half-space 2.7.3 

— model 19.7.1 

halving angles 8.7.3, 8.7.7.7 
harmonic conjugate 6.4.5 

— division 6.4.1, 10.7.10.2, 16.2.5, 

19.4.2 

— oscillator 13.5.6 

— quadrilateral 9.6.5.2, 9.14.15 
harmonically circumscribed 

14.5.4.4, 14.8.6, 17.9.5 
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— inscribed I 4 . 8 .IO 
Hart 16.6.1 

HausdorfF distance 9.11.1, 9.12.5, 
12.9.1 

— metric 9.11 

Hausdorifness of projective spaces 
4.3.3 

haversine 18.6.11 
Heaven and Hell 11.342 
helix 10.12.S, 10.13.22, 18.9.3 
Helly’s theorem 11.7.4.2, 11.7.7, 
11.9.18 

hemisphere 18.7.1 
heredity 10.8.1.1, 14.1.3.3, 14.5.2.4 
Hermitian space 13.5.7 
Heron’s formula 10.3.3, 10.13.7 
Hersch’s theorem 18.10.9 
heuristic remark 4.3.8, 12.6.8 
hexagonal web 5.5.8, 5.5.9 
Hilbert 19.7.2 

— spaces 11.86 

— geometry 11.9.4 

— spaces 13.5.6 
Hilbert-Dehn problem 12.2.7 
historical models 19.7.2 

— remarks 1.7.7.1, 4.0, 10.9.2.1, 

13.3.2, 11.116, 11.170, 11.318, 
19.1.1.6 

— remarks, see also Greeks 
Holder inequalities 11.8.11.8, 

12.11.5.4 
hole 12 . 7 . 5.4 

holomorphic function 9.5.4.S, 
18.1.3.2 

hoinofocal conics 17.6.S.1 

— quadrics 15.7.17 
homogeneity of affine spaces 2.2.3 
homogeneous coordinates 14.1.4.1, 

6.2.1, 7.5.5, 4 . 2 .S, 4.4.3, 4.5.13 

— of degree k 3.3.2.1, 3.7.12 

— piece of wire 2.7.5.6 

— plate 2.7.5.3, 3.4.2 

— polynomial S.S.l 

— space 1.5.4 
homogenizing variable 3.3.13 


homographies, fixed points of 6.6.1 

— of a conic 16.S.1 

— of a projective line 6.6, 6.8.15 
homography 4-5.8, 4.5.16, 4.7.3, 

6.4.11, 16.2.3, 16.4.13 

— axis 16.S.S 

— , invariants of 6.1.7 
homokinetic joint 18.11.16,18.11.16 
homology groups 18.2.5.1 
homomorphisms, set of 0.2 
homothety 6.6.2, 8.8.3, 2.S.S.9, 

3.1.2.2 

homotopic bases 2.7.2.7 
Hooke joint 18.11.16,18.11.16 
Hopf fibration 1.2.9, 1.6.4, 4-5.7, 
18.1.3.6, 18.9.4.2, 18.11.30 
horocycle 19.6.8.S 
horosphere 19.8.15 
horror cycle 19.6.8.3 
hyperbola 13.8.4, 15.S.S.2, 17.8 
hyperbolas, specific properties of 
17.8 

hyperbolic cosine 13.8.9, 0.5 

— distance 13.8.7 

— geometry 13.8.4, 1.8.6, 18.3.8.6 

— paraboloid 15.S.S.S, 15.3.3.3, 

15.3.3.3 

— plane 13.8.4, 13.1.4.4 

— regular polyhedron 19.8.26 

— sine 0.5 

— space 9.1.7, 9.7.4, 19 

— tangent 0.5 

— transformation 6.8.8 

— tiling 19.6.12 

— triangle 19.8.25 

— trigonometry 19.3 
hyperboloid 15.S.S.S 
hyperplane 2.4.2.1, 2.7.3 

— at infinity 5.1.S 

— reflection 9.2.4, 15.6.6.1, 14.7.4, 

19 . 4.2 

—s as spheres 20.3.5 
—s in hyperbolic space 19.2.10 
hypocycloid 9 .I 4 .S 4 .I, 10.4.5.5, 
10.11.3, 12.12.20.1, 17.8.3.2 
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I 

icosahedron 1.8.4, 8.12.12, 12.5.5 

— group 12.5.5.6 
identity map 0.1 

— matrix 0.2 

image focal point 6.6.4 

— measure 0.6 

— of a quadric I 4 .I.S. 8 , 15.1.1, 

15.1.S.2 

—s , line of the 10.13.16 
imaginary radius 20.1.8 
improper motions 9.1.4 
incommensurable rotations 8.4.7.3 
independent set 2 . 4 .S 
index IS.7.5, I 4 .I.I 
inequalities 10.4, 12.11.9 
infinitesimal angles 9.5.4.1 
infinity 4.5.15 

— , conic at 15.1.S.2 

— , hyperplane at 5.1.S 

— , point at 4.0.2, 5.2.S, 5.3, 

10.7.11, 15.1.S.2, 20.1.8 

— , quadric at 15.1.3.2 

— , sending objects to 5.4 
inner automorphism 1.2.6, 1.3.1, 

1.4.2, 1.5.2, 1.6.7.2, 2.7.5.11, 
6.8.13, 8.10.2 
inscribed polygon 9.4.2 

— triangle 9.4.1, 10.1.5 
Institut Geographique National 

18.1.8.5 
integers 0.2 

integral geometry 12.11.9.5 
integration theory 18.3.1, 0.6 
interior 19.1.3.3 

— bisector 10.1.4 

— radius 11.9.12 
internally tangent 10.7.5 
international date line 18.1.6.1 
intersection of affine subspaces 

2.4.9 

— of projective subspaces 4.6.10 

— of subspaces 4.0.1 

— of two conics 16.4 


— of two spheres 10.7.5, 20.4 
intrinsic metric 9.9.7, 9.9.8, 9.14.30, 

I 8 . 4.2 

— space 9. 9 . 4 .4 
invariance of domains 9.5.4.2, 

18.2.6.5 

invariants of homography 6.1.7 
inversion 9.5.3.6, 9.5.4.4, 9.5.4.18, 
9.14.15, 10.8, 18.9.1, I 8 .IO.I. 4 , 
19.6.11 

— sphere 10.8.1.1 

—s and differential geometry 10.8.5 

— and osculating circles 10.8.5.5 

— of a sphere 18.10.3.2 

— preserve angles 10.8.5.2 
involution 6.7.1 

—s of a conic 16.3.1 
—sofGP(£:) 4.5.19,6.4.7 
—sofO(g) 13.6.6 
involutive 14 . 8 . 12.2 

— correlations 14,5.5 
irreducible linear map 8.2.6 
IslS"^), finite subgroups of 18.5.10 
Is(Jf), generators of 9.3 

Islam 1.7.1 

isogonal circles 10.13.20 

— spheres 20.4.4 
isolated point 1.7.5.1 
isometric chart 18.1.7.1, 18.11.2 

— charts, non-existence of 18.4.4 

— spaces 13.1.4-1 

isometries of Artinian planes 13.8 

— of elliptic spaces 19.1.2.2 

— of Euclidean affine spaces 9.1, 

9.3 

— of hyperbolic space 19.4 

— of the sphere 18.5 
isometry 8.1.5, 0.3 

— group 1.2.8 
isomorphic spaces 13.1.4.1 
isomorphism between affine spaces 

2.2.6 

— , set of 0.2 

isoperiinetric inequality 10.4.1, 
10.5.1, 12.12.16, 12.11 
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isosceles lO.l.S, 10.2.2 
isotropic IS.2.1 

— cone 8.8, 8.8.6.1, IS.2.1 

— lines 8.8, 8.8.6.1, 1S.2.S.1, 

14.47 

isotropy group 9.8.1, 1.5.1 
Italy 11.22 

iterates of a homography 6.8.5 
J 

Jacobi identity 8.12.9 
Jacobian 4.9.4 

Joachimstal’s theorem 17.9.10 
Jordan curve theorem 18.2.6.3 

— decomposition 4.5.16 
Jordan-Brouwer separation theorem 

18.2.6 

Jung’s theorem 11.5.8, 11.9.18 
K 

Kakutani’s lemma 11.9.19 
/c-face of a polyhedron 12 .1.8 
Kirchberger’s theorem 11.9.10 
Klein, Felix 1.4 

— bottle 1.7.7.4 

— group 0.2, 6.3.2, 

— model 19.2.5 
Krasnosel’skii’s theorem 11.7.7 
Krein-Milman theorem 11.6.8 
Kritikos theorem 11.9.23 
Kugelungsatz 12.11.2, 12.11.5.3 

L 

Laguerre 20.6.4 

— ’s circles 10.11.6 

— formula 8.8.7.2, 17.4.3.1 
Lahire’s cogwheel 9 .I 4 .S 4 .S 

— theorem 17.9.14 
Lambert projection 18.1.8.5, 

18.11.27 

Latin quotation 9.6.9 
latitude 18.1.6.1 
latitude-longitude chart 18.1.6.1, 
18.3.4 


lattice 1.7.5.2, 9.14.29 
least-perimeter polygon 9.14.10 

— triangle 10.4.4 
Lebesgue, Henri I.xi 

— measure 0.6, 2.7.4.S, 8.2.5.2, 

9.12.1, 11.8.8.1, 12.2.5 
Lefschetz fixed point theorem 

18.2.5.7 

left translation 1.2.6, 1.4.4.2 
length 

length of a compact convex set 

12.10.2 

— of a curve 9.9.1, 9.12.4.1, 

12.11.6.1, 19.8.18 

— of hypo- or epicycloid 9.14.34.4 

— of ellipse 17.7.5 
lens 6.8.4, 18.1.1 
lever 18.11.1 

Lexell’s theorem 18.11.10 
Lhuillier, Simon 18.6.13.8 

— 's inequality 12.12.16 
Lie algebra 8.12.9 

— group 1.8.7.3, 2.7.5.12, 8.10.1, 

12.6.8 

light cone IS.2.1 

— polygon 9 . 4 . 1 , 9 . 4.2 

limit points of a pencil of circles 

10.10.1 

Lindelof 12.11.9.2 
line, affine 2 . 1 . 8 , 2 . 4 . 2.1 

— in elliptic space 19.1.1.5 

— in hyperbolic space 19.2.10 

— of the images 17.6.2.2, 10.13.16 

— of position 18.6.11 

— , projective 

linear algebra 4.6.8, 11.7.4.1, 2.4.8, 
4.0.5 

— equations, system of 2.2.3 

— group 0.2, 1.2.4 

— maps 0.2 

— programming 11.8.10.10 

link of a regular polytope 12.5.S.2 
linkedness 14.3.4, 10.12.2, 20.5.5 
Liouville 9.5.4.21 

— form 11.87 
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— ’s theorem 9.5.4, 10.8.5.4 
Lipschitz map 9.11.5, 9.11.6 
little circle 18.1.2.2 

— sphere 18.1.2.2 
live edges 12.8.6.4 

— face 12.8.6.4 

— vertices 12.8.6.4 
living creature 12.5.5.7 
Lobatchevsky 19.7.2 
local convexity 11.1.7.4 
locally compact field 4.3.5 
I’Oeuvre Notre-Dame museum 

(Strasbourg) 10.12.1 
Loewner-Behrend function 11.8.9 

— theorem 11.8.10.7 
logarithmic spiral 9.6.9.1, 9.14.21, 

9.14.32, 17.9.16, 18.11.3 
Lohr, Germany 14.4.6, 15.3.3.3 
longitude 18.1.6.1 
Lorentz group IS.6.1, 19.7.3 

— metrics 13 

lower Minkowski area 12.10.9.S 
loxodrome 18.1.8.2, 18.3.3, 18.11.3 
loxodromes 18.3.3 
!>,,-spaces 11.8.11.12 
Lucas theorem 11.9.21 
Liiroth 16.2.10 

M 

major axis 17.2.1.4 
Malfatti’s problem 10.11.5 
manifold, see differentiable 
manifold 

map, identity 0.1 

— restriction of a 0.1 
Marchaud 15.4.8 
mathematical zoo 11.86 
matrix associated to a quadratic 

form 13.1.3.6 

— , identity 0.2 

— , transpose of a 0.2 
maximal width 11.9.12 
maximum perimeter polygons 

17.6.5, 17.9.7 


measure of an angle IS.8.5, 8.S.9, 

8.4.3, 8.7.7.6 

— of an oriented angle 8.7.4.1 

— theory 11.6.6.2 

— zero 12.3.3.1, 9.4.4, 11.3.2 
measuring angles, difficulty in 

8.3.13 

mechanical devices 12.10.5 

— generation for ellipses 17.7.1 

— linkage 4.9.12, 10.8.3 
mechanics 2.7.5.3, 3.4.2, 3.4.6.1, 

8.9.5, 9.14.34.6, 11.87, 17.5.5.5, 
11.255, 18.6.1 
median 3.4.10, 10.1.4 

— of a tetrahedron 10.6.3 

— of a spherical triangles 18.11.5 
Menelaus, theorem of 2.8.2, 3.7.17 
Menger 9.7.4 

— curvature 9.14.31 
Mercator projection 18.1.8.2, 

18.10.3.2 

meridian 10.12.1, 18.1.6.2, 

— of a torus 18.8.6, 18.9.3 
meromorphic functions 9.5.4.8, 

18.1.3.2 

metric spaces 0.3 

metrical properties of conics 17.2, 

17.3, 17.4 

-of elliptic space 19.1.2 

Mexico 15.3.3.3 

midpoint S. 4 . 2 , 3.4.10, 6.4.2, 19.4.3 
Miguel’s six circle theorem. 10.9.7.2 
minimal width 11.5.9.5, 11.9.12 
minimum, problems of 10.4 
Minkowski 12.3.4, 11.9.16 

— area 12.10.9.S 

— geometry 15.5.10 

— inequality 11.8.11.10 

— sum 12.1.17, 11.1.3, 11.9.1, 

11.9.14 
mirror 6.8.4 
mixed product 8.11.S 
Mbbius group I8.IO.I.4, 18.10.2, 
18.11.21, 19.2.2, 19.4.6.4 

— invariant 18.10.8 
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— tetrahedra 4.9.12, 5.5.3, 14.5.5, 

10.6.7 

Mohr-Mascheroni, theorem of 

10.11.2 

moment 14.8.12.5 

— of inertia 9.12.6.3 
moral of the story 6.2.6 
Morley 9.14.38 

— ’s theorem 9.14.34.5, 10.3.10, 

10.13.4 

morphism, affine 2.3 
Mostow’s rigidity theorem 18.10.9 
motions 9.1.4 
Motzkin theorem 11.1.7.2 
movie projectors 12.10.5 
MTU 18.1.5.3 
multilinear map 7.3.1 
multiplicity of an intersection 

16.4.9 

multivalued algebraic 

correspondence 16.6.1 
museum I.ix, 11.7.6 
mystic hexagon 16.2.12 

N 

Nagel’s point 10.13.30 
Napier’s analogies 18.6.13.8 
Napoleon-Mascheroni, problem of 
10.11.2 

Napoleon’s theorem 9.14.44 
naturality 7.0.3 
navigation 18.6.1, 18.6.11 
nephroid 9.I4.S4.S 
neutral quadratic form IS.1.4-8 
Nevanlinna 9.5.4.7 
New Zealand 18.1.8.6 
Newton’s laws 17.2.1.7 
nine-point circle 17.5.4, 10.11.3 

— conic 16.7.5 

non-associativity of barycenters 

3.4.9 

non-degenerate form IS.2.1 
non-degenerate quadric 10.7.11 
non-Euclidean geometry 1.8.6 


non-negative integers 0.2 

— reals 0.2 

non-oriented angles 8.6.S, 13.8.7 
non-periodic hyperbolic tilings 
19.6.12 

— tiling 1.7.2, 1.9.16 
non-rectifiable curve 9.9.3.3, 12.12.9 
non-regular tiling 1.7.2 
non-singular IS.2.1 
non-singular completion IS.S.4.I 
norm 8 .1.1, 11.8.12.1 

— , canonical 9.2.6.4 

— of a quaternion 8.9.1 
normal 17.1.4 

— cone 11.6.2 

— endomorphism 8.12.5 

— to a conic 17.5.5.6 

— to a parabola 17.9.18.2 

— to a quadric 15.7.15 
normalization 8.1.4 
north pole 18.1.2.S 
notion of a geometry 1.4 
NuUstellensatz 14.1.6, 15.7.1 
number of holes 12.7.5.4 

O 

0{E), center of 8.2.16 

— compactness of 8.2.3.3 

O'*'(E), connectedness of 8.4.3 
0(q), involutions of 13.6.6 
0[E), simplicity of 8.5 
0-*^(3) 1.8.1, 4.3.9.2 
object focal point 6.6.4 
oblique Mercator projection 
18.1.8.S 
obtuse lO.l.S 
octahedron 1.8.4, 12.5 
octonions 2.6.7, 4-S.S, 8.9.1, 
18.1.3.5, 

octonionic projective plane 9.1.7 
one-dimensional differentiable 
manifolds 11.3.10.1 
one-sheet hyperboloid 15.S.S.S, 
11.132 
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one-sphere 4.3.6 
open ball O.S 

— hyperplane 2.1.3.2 
opposite half-line 8.6.1 
optics 6.6.5, 9.14.34.6 
orbit 1.6.1 

order of a point 11.6.1 

— of a root 16.4.2 
orientation-preserving map 2.1.2.5, 

9.5.1, 9.5.4.2, 8.8.2 
orientation-reversing, see 

oriental ion-preserving 
oriented angle 8.1.2.3, 8.1.1.2, 

9.2.1 

— sphere 20.6.4 

— triangle 10.4.5.1 

origin as a distinguished point 2.2.3 
orthocenter 10.2.5 
orthochronous rotation 13.8.3 
orthogonal affine subspaces 2.4.8.1 

— direct sum 8 .1.8.4 

— group 8.2.1, 12.6.8, 13.6.1; see 

also 0( ) 

— groups, algebraic topology of 

8.10 

— groups, simplicity of 13.6.8 

— projection 9.2.4, 9-12.4.9 

— projective group 14-1.2 

— reflection 8.2.9 

— spheres 10.1.1 

— subspaces 9.2, 13.3.1 
orthogonality 9.2.1 

— in Euclidean spaces 8.1.1 
orthogonalization 13.4 

— , simultaneous 13.5 
orthonormal 8.1.1, 9.1.1 
orthonormalization 8.1.4 
orthoptic circle 16.2.7.1, 17.4.2.3, 

17.6.1, 17.9.5 

— sphere 15.1.13 

osculating circle 10.8.5.5, 11.5.5.4 

— conics I6.4.5, 16.4.10.4, 16.4.13 

— curves 16.4.12 

outer automorphism 8.10.2 
outside 18.2.6.2 


overcrowding 19.8.21 
P 

p-group 1.6.7, 1.6.1.1 
p-transitive action 1.4.5 
Pacioli, Pra Luca 1.26, 11.3 
painting 12.3.8, 9.12.7 
paper band 8.10.3 
Pappus’s theorem 2.5.3, 2.8.9, 

5.4.1, 5.4.2, 16.2.12, 16.8.19 
parabola 1.4.2, 9.6.7.2, 10.13.18, 
15.3.3.2, 15.7.6, 17.5.5.5, 
17.9.18 

— tangent to four lines 16.7.4 
—s and the Simson line 17.4.3.5 
paraboloid 15.3.3.3, 15.5.3 
parallel affine subspaces 2.4-9.1 
—s of latitude 10.12.1, 18.1.6.2 

-on the torus 18.9.3 

parallelepipeds 9.7.3.1, 12.1.2.1 
parallelism 4.6.13, 5.3 
parallelogram rule 8.7.2.5, 2.5.2, 

2.6.6.4 

paratactic annulus 10.12.3 
parataxy 10.12, 18.9 
Pascal, Blaise 9.6.8 

— limagon 9.6.8,9.14.17,9.14.22 

— line 16.8.3 

— ’s theorem 16.2.11, 16.8.4, 

16.8.5, 2.8.2 
path 9.9.1 

path-connectedness of U 8.3.8 

— of circle 8,12.4 

— of projective spaces 4.3.3 

— of sphere 18.2.1 
Peano curve 18.2.4.1 
pedal curve 9.6.8, 17.2.2.6 

— triangle 10.4.5 

pencil of circles 10.10, 10.10.1 

— of conics 6.7.4, 16.5, 16.8.15, 

11.5.1 

— of hyperplanes 6.5.1 

— of lines 6.5.1 

— of planes 6.5.1 
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— of quadrics 14-2.6 

— of spheres 20.5 
pendulum 17.5.5.5 
pentaspheric coordinates 20.7.1 
perimeter 9.4-1, 9.4-2, 12.S.1 
permutation 1.5.2, 1.6-4 

— group 0.2 

— matrices 11.6.6.3, 11.9.7 
perpendicular bisector 9.7.5.1, 

18.11.5 

Perpetual Motion 1.7.4 
perspective 4.7, 4-7.S 
Petersen 9.14.38 
physical universe 8.8.1, 8.11.7, 
1.200, 12.10.11.2 
physics 8.9.5, 11.146 
pitch 9.S.7 
pivot 10. IS. 18 
plane, affine 2.1.8, 2.4.2.1 

— curves 8.2.14, 12.11.9.4 

— diopter 6.8.4 

— isometries 9.3.4 

— lattices 9.14.29 

— projective 

— similarities and complex numbers 

8.8.4, 9.6 
planet 17.2.1.7 

plate, homogeneous 2.7.5.3, 3.4.2 
Plato 12.5.5.7 
Pliicker’s theorem 16.5.6.3 
Poincare 19.7.1 

— metric 19.6.10 

— model 19.6 

point, affine 2.1.8, 2.4.2.1 
point at infinity 4.0.2, 5.2.S, 5.3, 
10.7.11, 15.1.S.2, 20.1.8 

— , projective 4-1 

polar 6.5.7, 14-5.1, 13.3.2 

— body 11.1.5.1, 12.1.2.6, 12.12.2 

— cone 18.11.15 

— form 8.7.9, S.S.2, IS.1.1, 13.9.8 

— hyperplane 14-5.1, 15.5.1 

— line, see polar 

— of a point with respect to two 

lines 6.5.7 


— quadric 14-^-4 

— regions 18.1.8.6 

— triangle 18.6.12.1, 19.8.3 
polarity 10.7.11, 14.5 

— with respect to a quadric 15.5.1 
pole 14.5.1, 18.1.2.S 

polygon 2.8.13, 10.5, 12.1.1, 18.7.1 
polygonal billiards 9.4 
polyhedra 12 

— , cardboard 12.1.3.2, 12.8 

— , convention about 12.1.6 

— , pictorial representation of 

12.1.3.3 

— , regular 1.8, 12.5 

— , structure of 12.1.5 
polyhedron 12.1.S.1 
polynomial S.S.l, S.S.5 

— map S.S.l, S.S.5 
polyspheric coordinates 20.7 
polytope 12, 12.1.1 

— , area of 12.2 

— , volume of 12.3 

Poncelet continuity principle 7.0.3 

— theorem, great 10.10.4, 10.13.3, 

16.5.5.3, 16.6, 17.6.5 

-, little 17.2.1.6, 17.6.3.6 

positive alternating form 2.7.2.2 

— definite quadratic form 8.1.1, 

11.1.2.8, 11.8.11.2 
positively oriented basis 2.7.2.2 

— oriented frame 2.7.2.10 
power of a point with respect to a 

sphere 10.7.10 
prestressed concrete 11.131 
principal circle 17.2.2.6 

— hyperbolic arc-cosine 0.5 
principle of similar figures 9.6.7 
privileged unit of length 8.8.1 
problems of minimum 10.4 
product of affine spaces 2.2.2 

— , semidirect 2.S.S.7 
proficiency of students 10.6 
projection 2.4.9.6 
projective base 4.4.1 
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— completion 5.0.1, 5.1.S, 7.0.2, 

17.4.1 

— coordinates .^..^.5,6.5.10 

— group 4-5.9, 4.9.5 

— hyperplanes 4-6.2 

— independence .^.6.6 

— isomorphism 4-5.2 

— line 4-5.2, 6.6, 6.8.15 

— model 19.2.5 

— morphism 4-5-2, 4.5.13 

— orthogonal group 8.7.1.1 

— plane 4-5-2 

— point 4-5.2 

— quadric 14-1.1, 14, 18.10.1 

— space 4-1.1, 7.0.1, 7.5.1, 8.9.3, 

9.1.7 

— subspace 4-5-1, 4.6.15, 5.3 
proper motion 9.1-4 

— quadric 14-1.1, 15.1.S.2 
Ptolemaic dynasty 12.5.5.7 
Ptolemy’s theorem 9.7.3.8, 10.9.2 
pullback of a quadratic form 

1S.1.S.9 

punctual mass S.4.5 
punctured plane 9.9.4.3 
pure quaternion 8.9.1, 8.11.IS 
purism 4.1.2 
pyramid 12.2.1 

Pythagorean theorem 9.2.3, 10.2.3 

Q 

quadratic extensions 7.0.6 

— fields 12.6.8 

— form S.S.2, 4.0.5, 8.8.6, 9.5.5, 

11.8.11.2, 13, IS.1.1 

— transformation 16.5.S.2, 16.8.24 
quadric 10.7.4, 4.0.5, 4.1.3.6; see 

also affine quadric, projective 
quadric 

— at infinity 15.1.S.2 

— , group of a 14.7 
—s in Art 4 14.4 

— of revolution 17.9.19 
quantum mechanics 4.0.6 


quartic 9.6.8 
quasi-spherical set 11.7.5 
quasi-symmetric set 11.7.5 
quaternionic projective spaces 9.1.7 
quaternions 0.2, 2.6.4, 4.8.2, 4.9.7, 
8.9, 8.9.1, 8.11.13, 8.12.11, 
12.6.10.2, 18.8.8 
Quetelet 17.3 
quintic equation 12.6.10.3 

R 

radical 13.9.1, 14-1.7, IS.2.1 

— axis 10.7.10.1, 10.10.1 

— center 10.7.10.«, 10.11.1 

— hyperplane 10.7.10.1 
radius of a sphere 10.7.1 

— of curvature, see curvature 

— zero 20.1.8 
Radon 11.7.4.1 

rank of a quadratic form IS.2.1 

— of a quadric 14-1.1, 15.1.S.2 
ratio 2.S.S.9, 8.8.2, 9.5.1 
rational function 4,2.6 

— numbers 0.2, 13.1.4.5 
real affine spaces 2.7 

— numbers 0.2 

— projective spaces 9.1.7 

— proper quadrics 14.3.2 

— vector spaces 7.0.1 
rectifiable curve 9.9.1, 12.11.6.1 
reduced quadric 15.6.1 

Reeb foliation 18.8.7 
reflection 2-4.9.6, 6-4-6, 8.2.9, 

9.2.4, 15.6.6.1, 19.1.2.4 
reflection axis 9.S.4 
regular octagon 11.22 

— pentagon 12.5.5.2, 12.12.4 

— polygon 12.4, 12.4-1 

— polyhedron 1.8, 12.5, 12.12.10 
-, hyperbolic 19.8.26 

— polytope 12.5, 12.5.1, 12.6, 

12.12.7 

— simplex 12.5.4.1, 11.5.8, 12.1.2.5 
relative interior of a convex set 

11.2.8 



Index 


423- 


relativity 19.7.3, 13.1.3.2 
representation 8.12.IS 
restriction of a map 0.1 

— of a quadratic form 1S.1.S.5 
Reuleaux triangle 11.5.9.2 
reversing 9.5.4.2 

reversing similarities 9.5.1 
Ricatti equation 6.8.12 
Riemann sphere 6.8.8, 18.1.3.2, 
4.2.5 

Riemannian geometry 9.10.7, 
18.6.11 

riemannian manifold 9.9.6 
Riemannian manifold 9.10.7, 
18.3.8.6, 18.10.9, 11.3.9 

— manifolds 12.7.5.2 

— metric 19.7.2 

— metrics 13 
right lO.l.S 

right angle 8.7.S.5, 8.1.1.4 

— translation 1.2.6 

— triangles 9.6.8.2 
rigid motions 9.1 
Roger Penrose 1.9.16 
rolling theorems 12.12.14 
roof 10.6.8.1 

Roofs 10.6.8 

root 16.4.1 

rotation 1S.6.S, 8.2.1 

rotation, in three dimensions 9.S.5 

rotation of a Euclidean plane 1.1.5 

— of finite order 1.8.1 
rotation, plane 9.S.4 

— three-dimensional 8.4.1.1, 8.9.5 
ruled surface 6 .8.20 

ruler 10.8.3, 10.9, 10.11.2, 12.4.6 

S 

Salmon’s principle 10.13.14 
scalar product 8.1.1 
Schlafli 12.6 
Schlafli’s theorem 12.6.7 
Schmidt, E. 12.11.5.3 
Schwarz inequality 8.11.7 


screw motion 9.S.5, 9.14.7 
search for maxima 11.8.10.10 
secant 10.1.5 
secant spheres 20.4.2 
second barycentric subdivision 
S.6.5 

— fundamental theorem of 

projective geometry 5.4.8 

— variation formula 9.10.7 
second-order formulas 18.6.13.8 
section of a group action 1.6.6 
sector 8.1.5.4 

segment 6.8.14, 9.9.4.1, S.4.S 
segments of conics 19.8.11 
self-devouring reptile 3.1.9 
self-polar I4.5.4.1, 16.4.10.1 
self-polar simplex 13.5.4.2, 14.5.4 

— triangle 14.8.11,10.13.24 
semiafline map 2.6.2 
semidefinite quadratic form 

11 . 8 . 11.2 

semidirect product 2.S.S.1 
semilinear map 6.4.10, 7.1.2.1, 
7.7.2, 2.6.2 

semimorphism 6.4.9, 7.5.2 
seniisimple Lie group 8.12.IS 

— Lie groups 1.8.7.3 
sending objects to infinity 5.4 
separates 11.4.S 
separation theorems 11.4 
sesquilinear 14.8.12.2 

set, cardinality of a 0.1 
—s difference of 0.1 
seven circle theorem 10.11.7 
shaddock 12.7.5.1 
shell architecture 14.4.6, 15.3.3.3, 
15.3.3.3 

shepherd principle 1.5.8 
shortest path 6.8.14, 9.9.5 

— paths in elliptic spaces 19.1.2.1 

— paths on the sphere 18.4.2 

side of a polyhedron 12.1.5, 12.1.8 

— of a triangle 10.1.2 

sides of a spherical polygon 18.1.S 

— of a spherical triangle 18.6.6 
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— of a triangle 2.^.1 

— of elliptic triangle 19.1.S.1 
sign flips 18.7.15 
signature 13.4.7 

signed area 10.4.5.1 
similarities 8.8, 9.5.1, 9.14.12 
similarities and angles 8.8.5 

— and complexiiications 8.8.6 
similarities, characterization of 

9.5.3 

similarity 8 .8.2 

similarity of triangles 10.2.7 

simple closed curve 12.11.6.1 

— field 8.5 

— Lie groups 12.6.8 

simplex I4.5.4.I, 11.6.1, 2.4.7, 3.7.8 
simplex, see also standard simplex 

12 . 1 . 11.1 

— solid 12.1.2.2 
simplicity of 0{E) 8.5 

— of orthogonal groups 13.6.8 
simply connected 12.7.5.3 

— transitive abelian group 8.7.2.5 

— transitive action 1.4-3 
Simson line 17.4.3.5, 17.8.3.2, 

10.4.5.4, 10.9.7, 10.11.3, 
10.13.27 

simultaneous orthogonalization 
13.5 

sine 8.3.6 

singular 14.1.7, 13.2.1 
singularities of differentiable maps 
12 . 6.8 

sink 18.10.2.3 

six-pointed shaddock 12.7.5.1 
sixth circle, theorem of 10.7.10.3 
skew base field 6.3.3 

— field 4.8.1, 6.1.7, 6.8.13, 6.8.15, 

6.8.18 

— field, finite 1.6.7.3 

— fields 2 
slope 16.7.1.1 
smooth point 11.6.1 

snake that bites its tail 3.1.9 
Sobolev spaces 11.86 


soccer balls 12.4.6 
solid ellipsoid 11.8.9.1 

— simplex 9.12.4.2, 12.1.2.2 
soup bowl 8.10.3 

source 18.10.2.3 

south pHDle 18.1.2.3 

space of generalized spheres 20.1.5 

Spain 1.7.1, 14.4.6 

span 4-^-5, 2.4.2.5 

special affine group 2.7.6 

Spencer-Glazer joint 18.11.16 

sphere 9.5.3.1, 9.7.4, 9.9.4.3, 10.7, 

10.7.1 

sphere, area of the 12.10.4.1 
sphere as a projective quadric 

18.10.1 

— as affine quadric 20.1.2 
sphere, canonical measure on 12.3.2 
sphere circumscribed around the 

regular simplex 11.5.9,1 

— of radius zero 20.1.8 

— S* 4.9.7 

— 5® 4.8.3 

spheres 9.1.7, 18, 1.6.4 
spheres, group structure on 8.9.1 
spheres of non-zero imaginary 
radius 20.1.8 
spheric mirror 9.14.34.3 
spherical conic 18.11.15 

— digon 18.3.8.2 

— geometry 1.8.6 

— harmonics 12.11.9.4 

— helices 9.14-34.3 

— helix 18.3.3 

— quadrilaterals 18.11.12 

— triangle 12.5.5.2, 18.6.2, 19.8.25 

— triangles 18.6 

— trigonometry 12.5.5.2, 18.4.4.1, 

18.6.1 

— zone 12.12.20.2 
spherometer 18.1.1, 18.11.1 
spinor 8.10.3 

spline 15 

stabilizer 9.8.1, 1.5, 1.5.1 
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stabilizer of a regular polygon 

12.4.3 

— of a tile 1.7.5.1 
standard {3,3,5} polytope. 

— {3,4,3} polytope. 

— cocube 12.1.2.5 

— cube 12.1.2.5 

— n-dimensional Euclidean affine 

space 9.1.2 

— projective space 4.1.S.1 

— simplex 12.1.11.1 

— solid simplex 12.1.2.5 

— tile 1.7.2 

star polytopes 12.6.10.5 

— regular poly topes 12.12.8 
star-shaped 11.1.2.4, 11.3.6, 11.7.6, 

11.9.6, 11.9.20 
starting point 9.9.1 
Steiner symmetrization 9.13, 

9.1S.1, 11.1.4, 12.10.10, 12.11.2, 
12 . 12.1 

Steiner-Miiikowski 12.11.3 
Steiiier-Minkowski formula 12.3.5, 
12 . 10.6 

— theorem 12.10.6 
Steiner’s alternative 10.10.3 
stereocomparator 4.7.5 
stereographic projection 4.3.8, 

8.7.6, 11.3.6.5, 18.1.4.1, 

18.1.8.6, 18.9.1, 18.10.2, 
18.10.3.2, 18.11.3, 19.6.11, 

Stewart relation 9.14.S5 
Stokes’ theorem 12.11.4 
straightedge, see ruler 
Strasbourg 10.12.1 
strict triangle inequality, see 
triangle inequality 
strictly convex function 11.8.5 

— convex set 11 .6.4 

— separated II.4.S 

string construction 15.7.16, 17.2.2.5 

— toy 17.6.4 

strip of paper 17.7.1 
strongly ergodic 9.4.4 


submanifold 14.3.8, 15.4.5; see also 
differentiable manifold 
subnormal to a parabola 17.9.18.1 
subsphere 18.1.2.1 
successive differences 12.10.6 
sum, direct 0.2 

sum of angles of a polygon 10.5.2 

— of angles of a triangle 1.8.6, 

10.2.4 

— of squares 11.87 
sun 17.2,1.7 

superosculating conics 16.4-5, 
16.4,10.5, 19.6.8.3 
supporting function 11.8.12.S, 
11.9.14 

— hyperplane 12.1.9, 11.5.1 
surface of constant negative 

curvature 19.6.12 

Swiss Federal Topographical Service 

18.1.8.3 

Sylow subgroups 1.6.7.3 
Sylvester’s law of inertia 13.4.7 

— theorem 9.14.25 

symbol S.S.5, 12.6.1, 12.6.7.4, 

20.1.5 

symmetric group 0.2, 1.2.2, 1.8.4, 
6,3,2, 8.10,2, 12,5,5.6, 12.6.8 
symmetries of quadrics 15.6.8 
symmetry 8.2.9 
symmetry groups 1.8.3.4, 1.8.7.2 
symplectic geometry 11.87 
system of linear equations 2.2.3 

T 

tangent bundle to an elliptic space 

19.1.1.4 

— bundle to hyperbolic space 

19.2.7 

— conics 16.4-5, 16.A.10.2 

— hyperplane 14-1-5-5 

— hyperplaiie to a sphere 10-7-4, 

18-1.2-4 

— of the half-arc 8.7.7.7 

— to a quadric 14-1-5-5 
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— vector 10.7.4, 18.1.2.4, 19.1.1.4 
tangential equation I4.6.I 

— pencil 14.6.2, 16.5.6.1, 17.6 

— quadric I4.6.2 
tensor algebra 8.11.2 
tetracyclic coordinates 20.7.1 
tetrahedron 1.8.3.4, 2.4.7, 6.8.21, 

9.12.4.4, 10.6, 10.13.10 
Thales’ theorem 6.5.5, 2.5.1, 4.4.2 
Thom, Rene 12.6.8 
third-order formulas 18.6.13.8 
three levels, formula of 12.12.20.7 
three-cusped hypocycloid 10.4.5.5, 
10.11.3 

three-sphere 1.2.9, 4.3.6.2, 8.9.1, 
18.1.3.3, 18.8 
tile, standard 1.7.2 
tiUng 1.7.2, 1.7.S, 1.7.4, 1.8, 1.9.6, 
12.6.10.4 

Tiniaeus 12.5.5.7 
Toledo, Spain 1.7.1 
topological closure 5.3.3 

— vector spaces 12.6.8 
topology of convex sets 11.3 

— ofGP(E) 4.5.20 

— of projective spaces 4.6.15, 4.3 

— of quadrics 14.3, 15.4 

— of spheres 18.2 
torsion 4.9.12, 10.8.5.5 
torsor 11.87, 14.8.12.5 

torus 1.7.7.4, 18.8.6, 18.11.20, 
10.12.1, 10.13.22 

— , flat 18.11.17 

— of revolution 18.9.2 

total mean curvature 12.10.9.1 
Tower of Babel 4.7.6 
trajectory of a vector field S.1.2.1 
transition map 4.9.4 
transitive action 1.4-1 
translate 2.2.S 

translation 6.6.2, 2.1.2, 1.2.6, 1.4.2, 
3.1.2.1 

transpose 0 .2, 8.1.8.6 
transverse axis 17.2.1-4 


— Mercator projection 18.1.5.3, 

18.1.8.S 
triality 8.10.2 

triangle 10.1, 10.1.1, 2-4-7, 3.4.2, 
9.12.4.4 

triangle in elliptic spaces 19.1.S.1 

— inequality 9.1.1.1, 18.4, 18.6.10, 

18.11.13, 19.1.2.1, 19.3.2 
triangle, spherical 12.5.5.2, 18.2.8.4, 
18.6 

triangle, hyperbolic 19.3.4 

— polar to one another 18.6.7 
triangular billiards 9.4 
trigonometry 8.7.7.7, 8.7.8, 8.12.8 
two-sheet hyperboloid 15.S.S.S 
two-sphere 1.8, 4.3.6, 8.9.1, 

18.1.3.2 

U 

umbilical locus 9.5.5.1 

— points 9.5.4.21 
underlying vector space 2.1.1 
unicursal curves 16.2.10 
unimodular group 2.7.6 
unit sphere 8.2.7 

unitary basis 11.8.9.2 
universal space of an affine space 3, 
7.6.1 

upper half plane 6.8.9 

— Minkowski area 12.10.9.S 

V 

valency 1.9.IS 

van Kampen theorem 18.2.4.2 
vector field 0.2, S.1.1, 18.2.5.3 

— product 8.9.1 

— spaces, orientation of 2.7.2.2 
vectorialization 2.1.9 
Veronese surface .^.5.9.1 
vertex of a conic 17.1.S 

— of a convex set 11.6.1 

— of a spherical polygon 18.6.6, 

18.7. S 

— of a triangle 10.1.2 
vertical billiard 17.9.2 
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Villarceau circle 18.9.3, 18.11.20, 

10.12.1 

vision I.ix, 4.7.3 

Viviani’s window 12.12.20.6 

Voderberg 1.7.2 

volume 9.12.4.1, 9.12.5, 9.12.7 

— of tetrahedron 9.12.4.4, 10.13.10 

— of balls 9.12.4.6 

— of compact convex sets 12.9.3 

— of ellipsoid 11.8.9.1 

— of parallelepiped 9.7.3.1 

— of polytopes 12.2 

—s on the sphere 18.S.8.1 

— of a spherical simplex 18.3.8.6 

— and Steiner symmetrization 

9.13.4 

von Staudt 6.4.10, 7.0.3 


W 

Wallace line 10.9.7.1 
Wankel engine 9.14.34.6, 12.10.5 
weakly ergodic 9.4.4 
— parallel affine subspaces 2.4-9.1 
web 5.5.8, 5.5.9 
Weyl group 12.6.9 
width 11.5.6.S, 11.5.9.5, 11.9.12, 
12.10.5 

wild vector fields 3.1.2 
wind 18.2.5.4 
wire, piece of 2.7.5.6 
Witt’s theorem 13.7.1, 13.7.8 

X 

Xochimilco, D.F., Mexico 11.155 



Acknowledgements 


We are pleased to acknowledge the permission of various publishers and in¬ 
stitutions to reproduce some of the figures appearing in this book: 

Figs. 9.12.7.2, 12.10.9.2.1, 18.1.1.3, 18.1.6.1: ©Armand Colin Editeur, Paris 

Fig. 10.12.1.2: ©Librairie Arthaud, Paris 

Figs. 1.7.4.8, 1.7.6.13, page 108 (vol. I), 19.6.12.5: 

©BEELDRECHT, Amsterdam/BILD-KUNST, Bonn 1982 

Figs. 1.7.4.1 - 1.7.4.5, 1.7.6.1 - 1.7.6.12: ©Cedic, Paris 

Figs. 12.1.3.2.1, 12.1.3.2.2, 12.6.10.5.3, 12.6.10.5.4: 

©Cambridge University Press, Cambridge 

Figs. 4.3.9.11, 4.3.9.12, 15.3.3.3.2, 18.1.8.6, 20.7.3: 

©Chelsea Publishing Company, New York 

Figs. 12.1.1.5, 12.1.1.6: ©1971 Columbia University Press, New York 

Fig. 18.9.4.2: ©Conference Board of the Mathematical Sciences, Washington 

Figs. 12.5.6.1, 12.5.6.2.1 - 12.5.6.2.4: ©Dover Publications, Inc., New York 

Fig. 18.1.8.7: ©Eyrolles, Paris 

Figs. 4.7.4.1, 4.7.4.2: ©Institut Geographique National, Paris 
(Autorisation no. 99-0175) 

Figs. 1.8.5, 19.6.12.4: ©John Wiley & Sons Inc., New York 

Figs. 14.4.4.6.3, 14.4.6.4, 15.3.3.3.3, 15.3.3.3.4: 

©Karl Kramer Verlag, Stuttgart 

Fig. page 22 (vol. II): ©Prestel-Verlag, Munchen 

Fig. 12.5.5.7: ©Princeton University Press, Princeton 

Fig. 12.10.5.2: ©Vandenhoeck & Ruprecht, Gottingen 



